Robustifying and Selecting Cohort-Appropriate Prognostic Models under Distributional Shifts
Dimitris Bertsimas, Carol Gao, Angelos G. Koulouras, Georgios Antonios Margonis

TL;DR
This study investigates how distributional differences affect prognostic model calibration across cohorts and proposes strategies to enhance model transportability and selection for clinical utility.
Contribution
It introduces methods to improve prognostic model calibration and selection across cohorts with distributional mismatches, validated on real-world surgical data.
Findings
Calibration worsens with increased distributional mismatch.
Meta-analysis-informed weighting improves calibration.
Models from similar cohorts have better calibration and utility.
Abstract
External validation is widely regarded as the gold standard for prognostic model evaluation. In this study, we challenge the assumption that successful external calibration guarantees model generalizability and propose two complementary strategies to improve transportability of prognostic models across cohorts. Using six real-world surgical cohorts from tertiary academic centers, we tested whether successful external calibration depends largely on similarity in covariates and outcomes between training and validation cohorts, quantified using Kullback-Leibler (KL) divergence, with calibration assessed by the Integrated Calibration Index (ICI). From the model-developer's perspective, we trained the "best-on-average" prognostic model by tuning toward a meta-analysis-derived covariate and outcome distribution as an approximation of the broader target population. From the end-user…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
