Development and validation of a multifactorial risk prediction model for breast cancer patients with co-occurring thyroid cancer: a retrospective matched case-control study
Junming Yin, Zhiwei Guo, Wen Yi, Ying He, Yi Luo, Kepeng Zhu, Songlin Yuan, Guocheng Du

TL;DR
This study developed a machine learning model to predict the risk of thyroid cancer in breast cancer patients, using factors like radiotherapy history and hormone levels.
Contribution
A novel XGBoost-based risk prediction model for co-occurring thyroid cancer in breast cancer patients was developed and validated.
Findings
XGBoost model achieved high AUC (0.874) and accuracy (86.7%) in predicting thyroid cancer co-occurrence.
Radiotherapy history, elevated TSH, ER-positive status, family history of thyroid cancer, and younger age were significant risk factors.
The model showed enhanced performance in patients with a history of radiotherapy (AUC = 0.921).
Abstract
To develop and validate a multifactorial machine learning model predicting thyroid cancer (TC) co-occurrence risk in breast cancer (BC) patients. This single-center retrospective matched case-control study analyzed 400 BC patients (200 with co-occurring TC, 200 matched BC-only controls) diagnosed between 2012-2025. Predictors included demographic, clinical, hormonal, and tumor biological variables. After feature selection via LASSO regression to handle multicollinearity, four machine learning algorithms (logistic regression, random forest, XGBoost, SVM) were developed and optimized using Bayesian hyperparameter tuning with 5-fold cross-validation. Model performance was evaluated on a 30% independent test set using AUC-ROC, calibration curves, and decision curve analysis. Multivariate analysis identified independent risk factors for TC co-occurrence: radiotherapy history (aOR = 3.42,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsThyroid Cancer Diagnosis and Treatment · Breast Cancer Treatment Studies · BRCA gene mutations in cancer
