Augmented balancing weights as linear regression
David Bruns-Smith, Oliver Dukes, Avi Feller, and Elizabeth L. Ogburn

TL;DR
This paper characterizes augmented balancing weights in linear models, revealing their equivalence to single linear regressions and analyzing their properties under various regularization techniques, thus clarifying their theoretical behavior.
Contribution
It provides a novel linear regression-based characterization of augmented balancing weights, connecting them to existing models and analyzing their asymptotic properties.
Findings
Augmented estimator often reduces to OLS under certain regularizations.
Kernel ridge regression augmented estimator is equivalent to undersmoothed ridge regression.
Lasso-penalized models exhibit a double selection property.
Abstract
We provide a novel characterization of augmented balancing weights, also known as automatic debiased machine learning (AutoDML). These popular doubly robust or de-biased machine learning estimators combine outcome modeling with balancing weights - weights that achieve covariate balance directly in lieu of estimating and inverting the propensity score. When the outcome and weighting models are both linear in some (possibly infinite) basis, we show that the augmented estimator is equivalent to a single linear model with coefficients that combine the coefficients from the original outcome model and coefficients from an unpenalized ordinary least squares (OLS) fit on the same data. We see that, under certain choices of regularization parameters, the augmented estimator often collapses to the OLS estimator alone; this occurs for example in a re-analysis of the Lalonde 1986 dataset. We then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Machine Learning and Data Classification
