Sparse Models and Methods for Optimal Instruments with an Application to Eminent Domain
Alexandre Belloni, Daniel Chen, Victor Chernozhukov, Christian Hansen

TL;DR
This paper introduces Lasso and Post-Lasso methods for estimating optimal instruments in high-dimensional linear IV models, allowing for many instruments and imperfect model selection, with applications to eminent domain effects.
Contribution
It develops new theoretical results for Lasso-based IV estimation in high-dimensional settings, including efficiency and robustness to model selection errors.
Findings
Lasso-based IV estimator is root-n consistent and asymptotically normal under approximate sparsity.
The estimator achieves semi-parametric efficiency with homoscedastic errors.
Simulation and empirical results show the proposed method outperforms existing approaches.
Abstract
We develop results for the use of Lasso and Post-Lasso methods to form first-stage predictions and estimate optimal instruments in linear instrumental variables (IV) models with many instruments, . Our results apply even when is much larger than the sample size, . We show that the IV estimator based on using Lasso or Post-Lasso in the first stage is root-n consistent and asymptotically normal when the first-stage is approximately sparse; i.e. when the conditional expectation of the endogenous variables given the instruments can be well-approximated by a relatively small set of variables whose identities may be unknown. We also show the estimator is semi-parametrically efficient when the structural error is homoscedastic. Notably our results allow for imperfect model selection, and do not rely upon the unrealistic "beta-min" conditions that are widely used to establish validity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
