Evaluating discriminatory accuracy of models using partial risk-scores   in two-phase studies

Parichoy Pal Choudhury; Anil K. Chaturvedi; Nilanjan Chatterjee

arXiv:1710.04379·stat.ME·October 13, 2017·2 cites

Evaluating discriminatory accuracy of models using partial risk-scores in two-phase studies

Parichoy Pal Choudhury, Anil K. Chaturvedi, Nilanjan Chatterjee

PDF

Open Access

TL;DR

This paper introduces an efficient method to evaluate the discriminatory accuracy of risk prediction models in two-phase studies using partial risk-scores, enabling validation with incomplete covariate data.

Contribution

The authors develop a non-parametric approach leveraging partial risk-scores for model evaluation, along with an influence function based variance estimation, applicable in complex two-phase study designs.

Findings

01

Method performs well in finite samples

02

Outperforms inverse probability weighted estimators in simulations

03

Successfully applied to lung cancer risk model data

Abstract

Prior to clinical applications, it is critical that risk prediction models are evaluated in independent studies that did not contribute to model development. While prospective cohort studies provide a natural setting for model validation, they often ascertain information on some risk factors (e.g., an expensive biomarker) in a nested sub-study of the original cohort, typically selected based on case-control status, and possibly some additional covariates. In this article, we propose an efficient approach for evaluating discriminatory ability of models using data from all individuals in a cohort study irrespective of whether they were sampled in the nested sub-study for measuring the complete set of risk factors. For evaluation of the Area Under the Curve (AUC) statistics, we estimate probabilities of risk-scores for cases being larger than those in controls conditional on partial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Causal Inference Techniques · Statistical Methods in Clinical Trials