Calibration with Bagging of the Principal Components on a Large Number of Auxiliary Variables
Caren Hasler, Arnaud Tripet, Yves Till\'e

TL;DR
This paper introduces a novel calibration method using bagging and principal components to stabilize weights and variance when handling many auxiliary variables in survey sampling.
Contribution
The paper proposes a new calibration approach combining bagging and principal component analysis to improve stability and efficiency in survey weight adjustment with numerous auxiliary variables.
Findings
Variance remains controlled with many auxiliary variables.
Calibration weights are more stable and less dispersed.
Method enables exact calibration for key variables while maintaining low scatter.
Abstract
Calibration is a widely used method in survey sampling to adjust weights so that estimated totals of some chosen calibration variables match known population totals or totals obtained from other sources. When a large number of auxiliary variables are included as calibration variables, the variance of the total estimator can increase, and the calibration weights can become highly dispersed. To address these issues, we propose a solution inspired by bagging and principal component decomposition. With our approach, the principal components of the auxiliary variables are constructed. Several samples of calibration variables are selected without replacement and with unequal probabilities from among the principal components. For each sample, a system of weights is obtained. The final weights are the average weights of these different weighting systems. With our proposed method, it is possible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurvey Sampling and Estimation Techniques · Statistical Methods and Bayesian Inference · Survey Methodology and Nonresponse
