TL;DR
This paper introduces a conservative partial identification method for estimating Covid-19 prevalence from serology tests with unknown parameters, providing valid finite-sample inference without relying on asymptotic assumptions.
Contribution
It develops a novel partial identification approach that handles unknown test parameters and offers finite-sample validity for disease prevalence estimation.
Findings
Serology test results suggest wide prevalence ranges, often inconclusive.
Prevalence estimates vary geographically, e.g., 0-2% in California, 13-17% in New York.
Combining datasets indicates a 5-8% prevalence, informing policy.
Abstract
We propose a partial identification method for estimating disease prevalence from serology studies. Our data are results from antibody tests in some population sample, where the test parameters, such as the true/false positive rates, are unknown. Our method scans the entire parameter space, and rejects parameter values using the joint data density as the test statistic. The proposed method is conservative for marginal inference, in general, but its key advantage over more standard approaches is that it is valid in finite samples even when the underlying model is not point identified. Moreover, our method requires only independence of serology test results, and does not rely on asymptotic arguments, normality assumptions, or other approximations. We use recent Covid-19 serology studies in the US, and show that the parameter confidence set is generally wide, and cannot support definite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
