Bayesian MI-LASSO for Variable Selection on Multiply-Imputed Data
Jungang Zou, Sijian Wang, Qixuan Chen

TL;DR
This paper introduces Bayesian MI-LASSO models for consistent variable selection on multiply-imputed datasets, improving accuracy and robustness over traditional methods through Bayesian shrinkage and Spike-Slab priors.
Contribution
It extends MI-LASSO to a Bayesian framework with four novel models and a projection predictive procedure for better variable selection and inference.
Findings
Bayesian MI-LASSO outperforms traditional MI-LASSO in simulations.
Proposed methods achieve higher specificity and lower MSE.
Application to real data demonstrates practical utility.
Abstract
Multiple imputation is widely used for handling missing data in real-world applications. For variable selection on multiply-imputed datasets, however, if selection is performed on each imputed dataset separately, it can result in different sets of selected variables across datasets. MI-LASSO, one of the most commonly used approaches to this problem, regards the same variable across all separate imputed datasets as a group variable and exploits the group LASSO to yield a consistent variable selection across all the multiply-imputed datasets. In this paper, we extend MI-LASSO to a Bayesian framework and propose four Bayesian MI-LASSO models for variable selection on multiply-imputed data, including three shrinkage prior-based and one Spike-Slab prior-based methods. To further support robust variable selection, we develop a four-step projection predictive variable selection procedure that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference · Bayesian Methods and Mixture Models
