Population Size Estimation with Many Lists and Heterogeneity: A Conditional Log-Linear Model Among the Unobserved
Mateo Dulce Rubio, Edward Kennedy

TL;DR
This paper introduces a flexible, interpretable framework for estimating population size with multiple capture-recapture lists, accommodating heterogeneity and dependence among lists, and employing machine learning for nonparametric estimation.
Contribution
It develops a novel identification strategy that generalizes existing models, along with doubly-robust estimators and sensitivity analysis, applicable to complex heterogeneous capture probabilities.
Findings
Method performs well on synthetic data.
Application to Peruvian conflict data estimates casualties.
Addresses critiques by relaxing assumptions and incorporating covariates.
Abstract
We contribute a general and flexible framework to estimate the size of a closed population in the presence of capture-recapture lists and heterogeneous capture probabilities. Our novel identifying strategy leverages the fact that it is sufficient for identification that a subset of the lists are not arbitrarily dependent \textit{within the subset of the population unobserved by the remaining lists}, conditional on covariates. This identification approach is interpretable and actionable, interpolating between the two predominant approaches in the literature as special cases: (conditional) independence across lists and log-linear models with no highest-order interaction. We derive nonparametric doubly-robust estimators for the resulting identification expression that are nearly optimal and approximately normal for any finite sample size, even when the heterogeneous capture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCensus and Population Estimation · Survey Sampling and Estimation Techniques
