High-dimensional regression with outcomes of mixed-type using the multivariate spike-and-slab LASSO
Soham Ghosh, Sameer K. Deshpande

TL;DR
This paper introduces a Bayesian high-dimensional multi-outcome regression model that estimates sparse covariate effects and outcome correlations for mixed binary and continuous data, with proven theoretical guarantees and practical applications.
Contribution
It develops a novel multivariate spike-and-slab LASSO approach with a Monte Carlo EM algorithm, providing theoretical posterior contraction rates and a sure screening property for mixed outcomes.
Findings
Accurate recovery of true covariate effects in simulations.
Effective estimation of outcome correlations in real data applications.
Strong finite-sample performance demonstrated across diverse scenarios.
Abstract
We consider a high-dimensional multi-outcome regression in which possibly dependent, binary and continuous outcomes are regressed onto covariates. We model the observed outcome vector as a partially observed latent realization from a multivariate linear regression model. Our goal is to estimate simultaneously a sparse matrix () of latent regression coefficients (i.e., partial covariate effects) and a sparse latent residual precision matrix (), which induces partial correlations between the observed outcomes. To this end, we specify continuous spike-and-slab priors on all entries of and off-diagonal elements of and introduce a Monte Carlo Expectation-Conditional Maximization algorithm to compute the maximum a posterior estimate of the model parameters. Under a set of mild assumptions, we derive the posterior contraction rate for our model in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Thermography in Medicine
