Sparse Linear Models and Two-Stage Estimation in High-Dimensional Settings with Possibly Many Endogenous Regressors
Ying Zhu

TL;DR
This paper analyzes the theoretical properties of two-stage Lasso-based estimation methods for high-dimensional sparse linear models with endogenous regressors, providing bounds and conditions for consistency.
Contribution
It extends two-stage estimation theory to high-dimensional settings with endogenous regressors, establishing non-asymptotic bounds and verifying key conditions for consistency.
Findings
Non-asymptotic bounds for estimation error and variable selection.
Conditions under which the two-stage estimator achieves consistency.
Simulation results illustrating finite sample performance.
Abstract
This paper explores the validity of the two-stage estimation procedure for sparse linear models in high-dimensional settings with possibly many endogenous regressors. In particular, the number of endogenous regressors in the main equation and the instruments in the first-stage equations can grow with and exceed the sample size n. The analysis concerns the exact sparsity case, i.e., the maximum number of non-zero components in the vectors of parameters in the first-stage equations, k1, and the number of non-zero components in the vector of parameters in the second-stage equation, k2, are allowed to grow with n but slowly compared to n. I consider the high-dimensional version of the two-stage least square estimator where one obtains the fitted regressors from the first-stage regression by a least square estimator with l_1-regularization (the Lasso or Dantzig selector) when the first-stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Sparse and Compressive Sensing Techniques · Distributed Sensor Networks and Detection Algorithms
