A data-driven approach to predicting breast cancer recurrence with hybrid machine learning models
Deepa B.G.,Velmurugan R.,Narender M.,Suhaas K.P
Abstract
Breast cancer recurrence is one of the most significant medical concern, and accurate recurrence models can assist in early intervention and treatment planning. Breast cancer recurrent remains as one of the most critical concern for patients prognosis and treatment planning. Accuracy Predicting individual recurrence risk is crucial for the development of precise therapy, specially for those patients with high-risk profiles. In the study proposes a hybrid machine learning approach that uses the computational modeling and the medical information to predict the recurrence of breast cancer in a patient. The dataset contains the medical and patient information like the age, tumor size, lymph node involvement, malignancy degree, location, irradiation status and recurrence class. This proposed approach begins with the process of data processing, handling the missing data values, features normalization and encoding of categorical variable into numerical format. The dataset is divided into two parts the training set and the testing set and the two selected models' random forest and logistic regression models are trained independently. The predictions form both the model is stacked and a logistic regression meta-model is trained on these combined predictions. The evaluation of the model was conducted using the metrics such as accuracy, precision, recall, and F1 score. The designed hybrid model was able to achieve the accuracy of 97.66% with the precision, recall and F1 score all reaching around 98.15%. This study highlights the potential of hybrid machine learning techniques, improving the accuracy and reliability of machine learning models for breast cancer recurrence prediction. This development model can serve as a valuable tool for the medical industry to support decision making and assist in personalized treatment decisions, offering early detection of recurrence. This can enhance the treatment of a patient by supporting early detection and patients' outcomes through targeted therapy.
Key Words
breast cancer; logistic regression; machine learning; recurrence; random forest
Address
Deepa B.G. — Department of Computer Science, Christ University, Bangalore, India
Velmurugan R. — Department of Computer Science, Kristu Jayanti (Deemed to be University), Bangalore, India
Narender M., Suhaas K.P — Department of Computer Science and Engineering, The National Institute of Engineering, Mysore, India
PDF Viewer
Preview uses the same access rules as Full Text PDF (subscription, purchase, or open access).