Challenges of the inconsistency regime: Novel debiasing methods for missing data models
Michael Celentano, Martin J. Wainwright

TL;DR
This paper addresses the challenge of estimating population means with missing data in high-dimensional settings where traditional methods fail, proposing a novel debiasing approach that remains consistent when the number of confounders exceeds the sample size.
Contribution
It introduces a new debiasing method for semi-parametric estimation in the high-dimensional, inconsistent regime where classical estimators are unreliable.
Findings
The proposed method is consistent for the population mean under proportional asymptotics.
It provides confidence intervals for linear model coefficients in the n < p setting.
Classical debiasing procedures become inconsistent in the n < p regime.
Abstract
We study semi-parametric estimation of the population mean when data is observed missing at random (MAR) in the "inconsistency regime", in which neither the outcome model nor the propensity/missingness model can be estimated consistently. Consider a high-dimensional linear-GLM specification in which the number of confounders is proportional to the sample size. In the case , past work has developed theory for the classical AIPW estimator in this model and established its variance inflation and asymptotic normality when the outcome model is fit by ordinary least squares. Ordinary least squares is no longer feasible in the case studied here, and we also demonstrate that a number of classical debiasing procedures become inconsistent. This challenge motivates our development and analysis of a novel procedure: we establish that it is consistent for the population mean…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
