Analysis of a capture-recapture estimator for the size of populations with heterogenous catchability, and its evaluation on RDS data from rural Uganda
Yakir Berchenko, Richard G. White, Cyprian Wejnert, Simon D.W. Frost

TL;DR
This paper introduces a new estimator for population size in capture-recapture studies with heterogeneous catchability, specifically tailored for Respondent Driven Sampling (RDS), and demonstrates its reduced bias in empirical data from rural Uganda.
Contribution
A novel generalized Horvitz-Thompson estimator is proposed for populations with variable catchability, improving accuracy over existing methods in RDS contexts.
Findings
The new estimator is less biased than the naive Lincoln-Petersen estimator.
Evaluation on Ugandan RDS data shows improved estimation accuracy.
Theoretical properties of the estimator are thoroughly analyzed.
Abstract
In this paper, we consider capture-recapture experiments with heterogenous catchability. In the setting we consider, the widespread Huggins-Alho estimator is not very suitable and we introduce and study a new generalized Horvitz-Thompson estimator. Our motivation is Respondent Driven Sampling (RDS), a prime example for such a setting where the capture probability is dependent on both the unknown population size as well as on an observable covariate, the network degree of an individual, due to peer recruitment. After discussing the theoretical properties of the new estimator, with full details given in the appendix, we evaluate it on various empirical and simulated data-sets, focusing on an RDS survey in a population in rural Uganda in which the population size is known a priori. The results thus obtained demonstrate that the adjusted estimator is less biased than the naive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCensus and Population Estimation · HIV, Drug Use, Sexual Risk · Data-Driven Disease Surveillance
