Estimation of finite population proportions for small areas -- a statistical data integration approach
Aditi Sen, Partha Lahiri

TL;DR
This paper introduces a novel data linkage and modeling approach for estimating small area population proportions, overcoming limitations of traditional empirical best prediction methods by leveraging large probability samples and advanced statistical techniques.
Contribution
It proposes a new data linkage method using a large probability sample, along with an adjusted maximum likelihood estimation and bootstrap error assessment, enhancing small area proportion estimation.
Findings
Effective in election projection for small areas
Improves variance estimation accuracy
Addresses computational challenges with EM algorithm
Abstract
Empirical best prediction (EBP) is a well-known method for producing reliable proportion estimates when the primary data source provides only small or no sample from finite populations. There are potential challenges in implementing existing EBP methodology such as limited auxiliary variables in the frame (not adequate for building a reasonable working predictive model) or unable to accurately link the sample to the finite population frame due to absence of identifiers. In this paper, we propose a new data linkage approach where the finite population frame is replaced by a big probability sample, having a large set of auxiliary variables but not the outcome binary variable of interest. We fit an assumed model on the small probability sample and then impute the outcome variable for all units of the big sample to obtain standard weighted proportions. We develop a new adjusted maximum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectoral Systems and Political Participation · Statistical Methods and Bayesian Inference · Census and Population Estimation
