A Machine Learning Framework for Breast Cancer Treatment Classification Using a Novel Dataset
Md Nahid Hasan, Md Monzur Murshed, Md Mahadi Hasan, and Faysal A. Chowdhury

TL;DR
This study develops machine learning models using a novel breast cancer dataset to predict treatment options, demonstrating high accuracy and interpretability, thus supporting personalized treatment decisions.
Contribution
Introduces a machine learning framework with novel dataset application for predicting breast cancer treatment, emphasizing model interpretability and performance evaluation.
Findings
Gradient Boosting Machine achieved 77.18% accuracy
XGBoost and AdaBoost also showed strong performance
Model interpretability was enhanced using SHAP values
Abstract
Breast cancer (BC) remains a significant global health challenge, with personalized treatment selection complicated by the disease's molecular and clinical heterogeneity. BC treatment decisions rely on various patient-specific clinical factors, and machine learning (ML) offers a powerful approach to predicting treatment outcomes. This study utilizes The Cancer Genome Atlas (TCGA) breast cancer clinical dataset to develop ML models for predicting the likelihood of undergoing chemotherapy or hormonal therapy. The models are trained using five-fold cross-validation and evaluated through performance metrics, including accuracy, precision, recall, specificity, sensitivity, F1-score, and area under the receiver operating characteristic curve (AUROC). Model uncertainty is assessed using bootstrap techniques, while SHAP values enhance interpretability by identifying key predictors. Among the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Breast Cancer Treatment Studies · Radiomics and Machine Learning in Medical Imaging
