A Symmetric Random Scan Collapsed Gibbs Sampler for Fully Bayesian Variable Selection with Spike-and-Slab Priors
Mengta Chung

TL;DR
This paper presents a scalable Bayesian variable selection method using a symmetric random scan Gibbs sampler that efficiently explores high-dimensional models with spike-and-slab priors, achieving high sensitivity and precision.
Contribution
The authors develop a novel symmetric random scan Gibbs sampler that reduces memory usage and improves sampling efficiency for Bayesian variable selection with spike-and-slab priors.
Findings
Achieves sensitivity of 1.0 and precision above 0.76 in simulations with 100,000 predictors.
Successfully identifies biologically validated genes in a genomic dataset.
Provides explicit guidance for tuning parameter selection based on correlation ratios.
Abstract
We introduce a symmetric random scan Gibbs sampler for scalable Bayesian variable selection that eliminates storage of the full cross-product matrix by computing required quantities on-the-fly. Data-informed proposal weights, constructed from marginal correlations, concentrate sampling effort on promising candidates while a uniform mixing component ensures theoretical validity. We provide explicit guidance for selecting tuning parameters based on the ratio of signal to null correlations, ensuring adequate posterior exploration. The posterior-mean-size selection rule provides an adaptive alternative to the median probability model that automatically calibrates to the effective signal density without requiring an arbitrary threshold. In simulations with one hundred thousand predictors, the method achieves sensitivity of 1.000 and precision above 0.76. Application to a genomic dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Gene expression and cancer classification · Markov Chains and Monte Carlo Methods
