Accounting for Mismatch Error in Small Area Estimation with Linked Data
Enrico Fabrizi, Nicola Salvati, Martin Slawski

TL;DR
This paper develops methods to adjust small area estimators for linkage errors in linked data, improving the accuracy of estimates in small area estimation models.
Contribution
It introduces linkage error adjustments for linear mixed effects and M-quantile regression models using a mixture model and EM algorithm, enhancing small area estimation accuracy.
Findings
Adjusted predictors outperform unadjusted ones in simulations.
Proposed methods effectively account for linkage errors.
Mean squared error approximations are provided for practical use.
Abstract
In small area estimation different data sources are integrated in order to produce reliable estimates of target parameters (e.g., a mean or a proportion) for a collection of small subsets (areas) of a finite population. Regression models such as the linear mixed effects model or M-quantile regression are often used to improve the precision of survey sample estimates by leveraging auxiliary information for which means or totals are known at the area level. In many applications, the unit-level linkage of records from different sources is probabilistic and potentially error-prone. In this paper, we present adjustments of the small area predictors that are based on either the linear mixed effects model or M-quantile regression to account for the presence of linkage error. These adjustments are developed from a two-component mixture model that hinges on the assumption of independence of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsdemographic modeling and climate adaptation · Statistical Methods and Bayesian Inference
