Empirical Evidence That There Is No Such Thing As A Validated Prediction Model
Florian D. van Leeuwen, Ewout W. Steyerberg, David van Klaveren, Ben, Wessler, David M. Kent, Erik W. van Zwet

TL;DR
This study demonstrates that due to significant heterogeneity, there is inherently large and often underestimated uncertainty in the predictive accuracy (AUC) of clinical prediction models across different settings, challenging the notion of a universally validated model.
Contribution
The paper introduces an empirical Bayesian method to better estimate the uncertainty in AUC predictions across external validations of clinical models, addressing limitations of traditional meta-analyses.
Findings
Large heterogeneity in AUCs across validations
Frequentist methods underestimate uncertainty
Bayesian approach improves coverage accuracy
Abstract
Background: External validations are essential to assess clinical prediction models (CPMs) before deployment. Apart from model misspecification, differences in patient population and other factors influence a model's AUC (c-statistic). We aimed to quantify variation in AUCs across external validation studies and adjust expectations of a model's performance in a new setting. Methods: The Tufts-PACE CPM Registry contains CPMs for cardiovascular disease prognosis. We analyzed the AUCs of 469 CPMs with a total of 1,603 external validations. For each CPM, we performed a random effects meta-analysis to estimate the between-study standard deviation among the AUCs. Since the majority of these meta-analyses has only a handful of validations, this leads to very poor estimates of . So, we estimated a log normal distribution of across all CPMs and used this as an empirical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Sepsis Diagnosis and Treatment · Artificial Intelligence in Healthcare
