Pilot Validation of AI-Derived Features for Prognostic Models in Geriatric Oncology
Alaa Albashayreh, W Nick Street, Stephanie Gilbertson-White

TL;DR
This study shows that adding AI-extracted information from clinical notes improves survival predictions for older lung cancer patients compared to using only standard data.
Contribution
The novel contribution is demonstrating that NLP-derived features from clinical notes significantly enhance prognostic model accuracy in geriatric oncology.
Findings
The enriched model with NLP features achieved higher AUC (0.768) and C-Statistic (0.717) than the baseline model.
Functional status and chemotherapy timing from clinical notes were key predictors in the improved model.
The enriched model effectively stratified patients into distinct prognostic groups with well-calibrated predictions.
Abstract
Accurate survival prediction for older adults with lung cancer is critical for guiding supportive care, yet standard models often underutilize valuable information in clinical notes. The objective of this study was to determine if a prognostic model integrating features from unstructured clinical notes could more accurately predict survival in older adults with lung cancer than a model using structured data alone. We studied 1,391 patients (≥60 years at time of diagnosis) with lung cancer, analyzing 19,377 clinical encounters. We compared a baseline survival model using structured electronic health record (EHR) data to an enriched model that integrated previously validated natural language processing (NLP)-derived features, including functional status. Models were evaluated for 12-month mortality using AUC and C-Statistic. Thirty-eight percent of patients presented with metastatic…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare and Education · Radiomics and Machine Learning in Medical Imaging
