Predictive case control designs for modification learning
W Katherine Tan, Patrick J Heagerty

TL;DR
This paper introduces predictive case control (PCC) sampling designs to efficiently gather outcome labels for model modification learning, reducing sample size and cost compared to simple random sampling.
Contribution
It proposes a novel PCC sampling method based on model scores, with mathematical justification and a computational framework for design evaluation.
Findings
PCC sampling achieves effective model modification with smaller samples.
Simulation shows PCC outperforms simple random sampling in information efficiency.
Application to radiology data demonstrates practical utility of PCC in clinical settings.
Abstract
Prediction models for clinical outcomes may be developed using a source dataset and additionally applied to new settings. Towards model external validation and model updating in the new setting, one procedure is model modification learning that involves the dual goals of recalibrating overall predictions as well as revising individual feature effects. Modification learning generally requires the collection of an adequate sample of true outcome labels from the new setting, which is frequently an expensive and time-consuming process, as it involves abstraction by human clinical experts. To reduce the abstraction burden for such new data collection, we propose a class of designs based on original model scores and their associated outcome predictions. We provide mathematical justification that the general predictive score sampling class results in valid samples for analysis. Then, we focus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Imbalanced Data Classification Techniques
