GEMSS: A Variational Bayesian Method for Discovering Multiple Sparse Solutions in Classification and Regression Problems

Kate\v{r}ina Henclov\'a; V\'aclav \v{S}m\'idl

arXiv:2602.08913·cs.LG·February 10, 2026

GEMSS: A Variational Bayesian Method for Discovering Multiple Sparse Solutions in Classification and Regression Problems

Kate\v{r}ina Henclov\'a, V\'aclav \v{S}m\'idl

PDF

Open Access

TL;DR

GEMSS is a Bayesian framework that discovers multiple diverse sparse feature sets simultaneously in classification and regression, providing comprehensive insights into possible explanations in high-dimensional, noisy, and imbalanced data settings.

Contribution

It introduces a novel variational Bayesian method with structured priors and diversity penalties to identify multiple solutions in a single optimization, unlike traditional sequential approaches.

Findings

01

Scales effectively to high-dimensional data with small sample sizes

02

Handles missing data, noise, and class imbalance robustly

03

Outperforms existing methods in benchmark experiments

Abstract

Selecting interpretable feature sets in underdetermined ( $n ≪ p$ ) and highly correlated regimes constitutes a fundamental challenge in data science, particularly when analyzing physical measurements. In such settings, multiple distinct sparse subsets may explain the response equally well. Identifying these alternatives is crucial for generating domain-specific insights into the underlying mechanisms, yet conventional methods typically isolate a single solution, obscuring the full spectrum of plausible explanations. We present GEMSS (Gaussian Ensemble for Multiple Sparse Solutions), a variational Bayesian framework specifically designed to simultaneously discover multiple, diverse sparse feature combinations. The method employs a structured spike-and-slab prior for sparsity, a mixture of Gaussians to approximate the intractable multimodal posterior, and a Jaccard-based penalty to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Machine Learning in Materials Science · Explainable Artificial Intelligence (XAI)