Integrative Prognostic Modeling of Breast Cancer Survival with Gene Expression and Clinical Data
Robert Amevor, Emmanuel Kubuafor, Dennis Baidoo, Junaidu Salifu, Koshali Muthunama Gonnage, Onyedikachi Joshua Okeke

TL;DR
This study develops an integrative model combining gene expression and clinical data to accurately predict breast cancer survival, significantly outperforming models based on clinical data alone.
Contribution
The paper introduces a novel integrative prognostic model that combines gene expression signatures with clinical variables using penalized Cox regression and machine learning techniques.
Findings
The integrative model achieved a C-index of 0.922 and 36-month AUC of 0.94.
Gene expression data significantly improved survival prediction over clinical data alone.
Top predictive genes identified include OR2T27, TBATA, LINC01165, and SLC10A4.
Abstract
Background: Accurate survival prediction in breast cancer is essential for patient stratification and personalized therapy. Integrating gene expression data with clinical factors may enhance prognostic performance and support precision medicine. Objective: To develop an integrative survival prediction model combining clinical variables and gene expression signatures, and to assess their contributions using penalized Cox regression and machine learning. Methods: We analyzed 1,867 patients from the METABRIC cohort with clinical annotations and microarray-based gene expression profiles. The top 5,000 most variable genes were retained. Elastic Net-penalized Cox regression identified 75 predictors (70 genes and 5 clinical variables: tumor size, stage, surgery type, age at diagnosis, and Nottingham Prognostic Index). Model performance was evaluated with Harrell's concordance index (C-index)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
