Bayesian variable selection in linear regression models with instrumental variables
Gautam Sabnis, Yves Atchad\'e, Prosper Dovonon

TL;DR
This paper introduces a Bayesian semi-parametric method for variable selection in high-dimensional linear regression models with endogenous regressors using instrumental variables, addressing endogeneity issues effectively.
Contribution
It develops a novel Bayesian approach combining quasi-likelihood and spike-and-slab priors for high-dimensional IV models, with theoretical guarantees and empirical validation.
Findings
Method performs well compared to alternatives
Theoretical conditions ensure posterior concentration
Empirical application revisits return on education
Abstract
Many papers on high-dimensional statistics have proposed methods for variable selection and inference in linear regression models by relying explicitly or implicitly on the assumption that all regressors are exogenous. However, applications abound where endogeneity arises from selection biases, omitted variables, measurement errors, unmeasured confounding and many other challenges common to data collection Fan et al. (2014). The most common cure to endogeneity issues consists in resorting to instrumental variable (IV) inference. The objective of this paper is to present a Bayesian approach to tackling endogeneity in high-dimensional linear IV models. Using a working quasi-likelihood combined with an appropriate sparsity inducing spike-and-slab prior distribution, we develop a semi-parametric method for variable selection in high-dimensional linear models with endogeneous regressors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Advanced Statistical Methods and Models
