Data Integration through outcome adaptive LASSO and a collaborative propensity score approach
Asma Bahamyirou, Mireille E. Schnitzer

TL;DR
This paper introduces two adaptive variable selection methods using outcome adaptive LASSO and collaborative propensity scores to improve population mean estimation from administrative data, especially in high-dimensional settings.
Contribution
It develops novel data adaptive techniques for variable selection in doubly robust estimation, addressing high-dimensional covariates in administrative data analysis.
Findings
Simulation studies show improved accuracy over existing methods.
Methods demonstrate robustness in high-dimensional covariate scenarios.
Application highlights Covid-19's impact on Canadians.
Abstract
Administrative data, or non-probability sample data, are increasingly being used to obtain official statistics due to their many benefits over survey methods. In particular, they are less costly, provide a larger sample size, and are not reliant on the response rate. However, it is difficult to obtain an unbiased estimate of the population mean from such data due to the absence of design weights. Several estimation approaches have been proposed recently using an auxiliary probability sample which provides representative covariate information of the target population. However, when this covariate information is high-dimensional, variable selection is not a straight-forward task even for a subject matter expert. In the context of efficient and doubly robust estimation approaches for estimating a population mean, we develop two data adaptive methods for variable selection using the outcome…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurvey Sampling and Estimation Techniques · Statistical Methods and Bayesian Inference · Economic and Environmental Valuation
