Model-assisted estimation in high-dimensional settings for survey data
Mehdi Dagdoug, Camelia Goga, David Haziza

TL;DR
This paper evaluates various model-assisted estimators, including penalized and tree-based methods, in high-dimensional survey data, demonstrating their bias and efficiency through extensive simulations.
Contribution
It extends the analysis of model-assisted estimators to high-dimensional settings, comparing their performance using real-world smart metering data.
Findings
Penalized estimators show reduced bias in high-dimensional data.
Tree-based estimators improve efficiency over traditional methods.
Model-assisted estimators maintain consistency regardless of model correctness.
Abstract
Model-assisted estimators have attracted a lot of attention in the last three decades. These estimators attempt to make an efficient use of auxiliary information available at the estimation stage. A working model linking the survey variable to the auxiliary variables is specified and fitted on the sample data to obtain a set of predictions, which are then incorporated in the estimation procedures. A nice feature of model-assisted procedures is that they maintain important design properties such as consistency and asymptotic unbiasedness irrespective of whether or not the working model is correctly specified. In this article, we examine several model-assisted estimators from a design-based point of view and in a high-dimensional setting, including penalized estimators and tree-based estimators. We conduct an extensive simulation study using data from the Irish Commission for Energy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWater resources management and optimization · Water Systems and Optimization · Energy and Environment Impacts
