Regularization Methods for High-Dimensional Instrumental Variables Regression With an Application to Genetical Genomics
Wei Lin, Rui Feng, Hongzhe Li

TL;DR
This paper introduces a two-stage regularization approach for high-dimensional instrumental variables regression, effectively handling large numbers of covariates and instruments in genetical genomics studies.
Contribution
It extends classical two-stage least squares to high dimensions using sparsity-inducing penalties, with theoretical guarantees and practical implementation.
Findings
Method performs well in simulations.
Effective in selecting relevant variables.
Applied successfully to mouse obesity data.
Abstract
In genetical genomics studies, it is important to jointly analyze gene expression data and genetic variants in exploring their associations with complex traits, where the dimensionality of gene expressions and genetic variants can both be much larger than the sample size. Motivated by such modern applications, we consider the problem of variable selection and estimation in high-dimensional sparse instrumental variables models. To overcome the difficulty of high dimensionality and unknown optimal instruments, we propose a two-stage regularization framework for identifying and estimating important covariate effects while selecting and estimating optimal instruments. The methodology extends the classical two-stage least squares estimator to high dimensions by exploiting sparsity using sparsity-inducing penalty functions in both stages. The resulting procedure is efficiently implemented by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
