Design-Based Inference for the AUC with Complex Survey Data
Amaia Iparragirre, Thomas Lumley, Irantzu Barrio

TL;DR
This paper introduces a design-based inference framework for the AUC in complex survey data, utilizing replicate weights to improve accuracy over traditional bootstrap methods, with validation through simulations and NHANES data.
Contribution
It develops a novel design-based approach for AUC inference in complex surveys, implemented in the svyROC R package, addressing limitations of existing bootstrap methods.
Findings
Design-based methods achieve near-nominal coverage probabilities.
Traditional bootstrap underestimates variance, causing undercoverage.
Performance improves with more clusters per stratum.
Abstract
Complex survey data are usually collected following complex sampling designs. Accounting for the sampling design is essential to obtain unbiased estimates and valid inferences when analyzing complex survey data. The area under the receiver operating characteristic curve (AUC) is routinely used to assess the discriminative ability of predictive models for binary outcomes. However, valid inference for the AUC under complex sampling designs remains challenging. Although bootstrap techniques are widely applied under simple random sampling for variance estimation in this framework, traditional implementations do not account for complex designs. In this work, we propose a design-based framework for AUC inference. In particular, replicate weights methods are used to construct confidence intervals and hypothesis tests. The performance of replicate weights methods and the traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
