TL;DR
This paper introduces a convex optimization approach to Bayesian variable selection, demonstrating that the mode-finding problem is more tractable than previously thought, with a simple algorithm converging to the global optimum.
Contribution
It reformulates the marginal likelihood optimization as a difference of convex functions and provides a convergent algorithm, offering an efficient alternative to MCMC methods.
Findings
The DC algorithm converges to the global optimum at a linear rate.
The approach extends to maximum marginal posterior with suitable priors.
The method is easy to implement, tuning-free, and applicable to structured sparsity.
Abstract
Variable selection in linear regression has been a central topic in statistical research for decades. Bayesian variable selection methods, which account for uncertainty in both the regression coefficients and the noise variance, have achieved broad success through the use of discrete or continuous shrinkage priors and efficient collapsed Gibbs samplers. Despite their popularity and strong empirical performance, an enigma remains: the marginal likelihood, obtained by integrating out the regression coefficients and noise variance, is not log-concave; therefore, there is no guarantee of reliably finding its global optimum. In this article, we study this problem from an optimization perspective. Taking the negative log-marginal likelihood as a loss function of the latent precision parameters, we can rewrite it as a difference of convex functions (DC), and then optimize it via a simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
