GEMSS: A Variational Bayesian Method for Discovering Multiple Sparse Solutions in Classification and Regression Problems
Kate\v{r}ina Henclov\'a, V\'aclav \v{S}m\'idl

TL;DR
GEMSS is a Bayesian framework that discovers multiple diverse sparse feature sets simultaneously in classification and regression, providing comprehensive insights into possible explanations in high-dimensional, noisy, and imbalanced data settings.
Contribution
It introduces a novel variational Bayesian method with structured priors and diversity penalties to identify multiple solutions in a single optimization, unlike traditional sequential approaches.
Findings
Scales effectively to high-dimensional data with small sample sizes
Handles missing data, noise, and class imbalance robustly
Outperforms existing methods in benchmark experiments
Abstract
Selecting interpretable feature sets in underdetermined () and highly correlated regimes constitutes a fundamental challenge in data science, particularly when analyzing physical measurements. In such settings, multiple distinct sparse subsets may explain the response equally well. Identifying these alternatives is crucial for generating domain-specific insights into the underlying mechanisms, yet conventional methods typically isolate a single solution, obscuring the full spectrum of plausible explanations. We present GEMSS (Gaussian Ensemble for Multiple Sparse Solutions), a variational Bayesian framework specifically designed to simultaneously discover multiple, diverse sparse feature combinations. The method employs a structured spike-and-slab prior for sparsity, a mixture of Gaussians to approximate the intractable multimodal posterior, and a Jaccard-based penalty to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning in Materials Science · Explainable Artificial Intelligence (XAI)
