Prevalence Estimation from Random Samples and Census Data with Participation Bias
St\'ephane Guerrier, Christoph Kuzmics, Maria-Pia Victoria-Feser

TL;DR
This paper develops improved prevalence estimators for COVID-19 using survey data and official counts, accounting for participation bias and test accuracy, leading to more precise estimates with smaller samples.
Contribution
It introduces novel maximum likelihood and moment-based estimators that incorporate official data and measurement errors, enhancing prevalence estimation accuracy.
Findings
Estimators outperform simple sample proportions in accuracy.
Smaller survey samples achieve desired precision with new estimators.
Open source R package 'cape' implements the methods.
Abstract
Countries officially record the number of COVID-19 cases based on medical tests of a subset of the population with unknown participation bias. For prevalence estimation, the official information is typically discarded and, instead, small random survey samples are taken. We derive (maximum likelihood and method of moment) prevalence estimators, based on a survey sample, that additionally utilize the official information, and that are substantially more accurate than the simple sample proportion of positive cases. Put differently, using our estimators, the same level of precision can be obtained with substantially smaller survey samples. We take into account the possibility of measurement errors due to the sensitivity and specificity of the medical testing procedure. The proposed estimators and associated confidence intervals are implemented in the companion open source R package cape.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 epidemiological studies · Advanced Causal Inference Techniques · Statistical Methods and Bayesian Inference
