Evaluating discriminatory accuracy of models using partial risk-scores in two-phase studies
Parichoy Pal Choudhury, Anil K. Chaturvedi, Nilanjan Chatterjee

TL;DR
This paper introduces an efficient method to evaluate the discriminatory accuracy of risk prediction models in two-phase studies using partial risk-scores, enabling validation with incomplete covariate data.
Contribution
The authors develop a non-parametric approach leveraging partial risk-scores for model evaluation, along with an influence function based variance estimation, applicable in complex two-phase study designs.
Findings
Method performs well in finite samples
Outperforms inverse probability weighted estimators in simulations
Successfully applied to lung cancer risk model data
Abstract
Prior to clinical applications, it is critical that risk prediction models are evaluated in independent studies that did not contribute to model development. While prospective cohort studies provide a natural setting for model validation, they often ascertain information on some risk factors (e.g., an expensive biomarker) in a nested sub-study of the original cohort, typically selected based on case-control status, and possibly some additional covariates. In this article, we propose an efficient approach for evaluating discriminatory ability of models using data from all individuals in a cohort study irrespective of whether they were sampled in the nested sub-study for measuring the complete set of risk factors. For evaluation of the Area Under the Curve (AUC) statistics, we estimate probabilities of risk-scores for cases being larger than those in controls conditional on partial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Causal Inference Techniques · Statistical Methods in Clinical Trials
