One-step Estimation of Networked Population Size: Respondent-Driven Capture-Recapture with Anonymity
Bilal Khan, Hsuan-Wei Lee, Ian Fellows, Kirk Dombrowski

TL;DR
This paper introduces three novel one-step network-based estimators for hidden population size, applicable in anonymous respondent-driven sampling, validated through simulations and real-world data, aiding health surveillance efforts.
Contribution
It presents new estimators for hidden population sizes that work with anonymity and clustering, extending existing survey methodologies with proven consistency conditions.
Findings
Estimators perform well on synthetic networks with clustering.
Effective in estimating populations of 5,000-40,000 from samples of 250-750.
Anonymity schemes maintain estimator accuracy with tradeoffs.
Abstract
Population size estimates for hidden and hard-to-reach populations are particularly important when members are known to suffer from disproportion health issues or to pose health risks to the larger ambient population in which they are embedded. Efforts to derive size estimates are often frustrated by a range of factors that preclude conventional survey strategies, including social stigma associated with group membership or members' involvement in illegal activities. This paper extends prior research on the problem of network population size estimation, building on established survey/sampling methodologies commonly used with hard-to-reach groups. Three novel one-step, network-based population size estimators are presented, to be used in the context of uniform random sampling, respondent-driven sampling, and when networks exhibit significant clustering effects. Provably sufficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
