Real world deployment of a pancreatic cancer risk model: impact of refitting, imputation, and computational burden
Wansu Chen, Botao Zhou, Tiffany Q. Luong, Fagen Xie, Bechien U. Wu

TL;DR
This paper evaluates how to effectively deploy a pancreatic cancer risk model in real-world clinical settings, focusing on model refitting and data imputation strategies.
Contribution
The study provides practical guidance on deploying predictive models in clinical settings by comparing model refitting and imputation methods.
Findings
Refitting the model improved discrimination and calibration compared to using the original model.
IFCE imputation offered the best balance between performance and computational efficiency.
Model performance varied across racial and ethnic groups, with poorest calibration among Black patients.
Abstract
Early detection is a major clinical challenge in pancreatic cancer due to its nonspecific symptoms and frequent late-stage diagnosis. While predictive models using electronic health record (EHR) data show promise, their real world implementation remains underexplored. We previously developed a random survival forest (RSF) model to estimate pancreatic cancer risk using structured EHR data from 2007 to 2017. This study evaluates practical considerations for deploying such a model in a prospective clinical context. We refit the original RSF model using a cohort from 2018 to 2019 and evaluated its performance on a 2020 cohort. We assessed how model refitting and different imputation strategies influenced predictive performance and compared execution times to evaluate computational feasibility. Three imputation strategies were tested: sub-model estimation (SME), stacked multiple imputation…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPancreatic and Hepatic Oncology Research · Pancreatitis Pathology and Treatment · Cancer Genomics and Diagnostics
