MCMC algorithms for Bayesian variable selection in the logistic regression model for large-scale genomic applications
Manuela Zucknick, Sylvia Richardson

TL;DR
This paper develops and compares MCMC algorithms that leverage dependence structures in Bayesian variable selection for large-scale genomic logistic regression, improving computational efficiency and convergence.
Contribution
It introduces dependence-aware MCMC samplers tailored for high-dimensional genomic data, enhancing scalability and performance over standard methods.
Findings
Dependence-aware samplers outperform standard Gibbs in mixing and convergence.
The proposed methods are effective in real gene expression data analysis.
Simulation studies confirm improved computational efficiency.
Abstract
In large-scale genomic applications vast numbers of molecular features are scanned in order to find a small number of candidates which are linked to a particular disease or phenotype. This is a variable selection problem in the "large p, small n" paradigm where many more variables than samples are available. Additionally, a complex dependence structure is often observed among the markers/genes due to their joint involvement in biological processes and pathways. Bayesian variable selection methods that introduce sparseness through additional priors on the model size are well suited to the problem. However, the model space is very large and standard Markov chain Monte Carlo (MCMC) algorithms such as a Gibbs sampler sweeping over all p variables in each iteration are often computationally infeasible. We propose to employ the dependence structure in the data to decide which variables should…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Gene expression and cancer classification
