Bayes Shrinkage at GWAS scale: Convergence and Approximation Theory of a Scalable MCMC Algorithm for the Horseshoe Prior
James E. Johndrow, Paulo Orenstein, Anirban Bhattacharya

TL;DR
This paper introduces two scalable MCMC algorithms for the horseshoe prior in high-dimensional Bayesian models, with theoretical guarantees and practical improvements demonstrated on large-scale GWAS data.
Contribution
The authors develop and analyze two new MCMC algorithms that improve scalability and accuracy for high-dimensional horseshoe prior models, including an approximation method with theoretical guarantees.
Findings
Algorithms outperform existing methods in speed and accuracy.
Application to GWAS data demonstrates scalability to 50,000 predictors.
New insights into posterior features like bimodality and variable uncertainty.
Abstract
The horseshoe prior is frequently employed in Bayesian analysis of high-dimensional models, and has been shown to achieve minimax optimal risk properties when the truth is sparse. While optimization-based algorithms for the extremely popular Lasso and elastic net procedures can scale to dimension in the hundreds of thousands, algorithms for the horseshoe that use Markov chain Monte Carlo (MCMC) for computation are limited to problems an order of magnitude smaller. This is due to high computational cost per step and growth of the variance of time-averaging estimators as a function of dimension. We propose two new MCMC algorithms for computation in these models that have improved performance compared to existing alternatives. One of the algorithms also approximates an expensive matrix product to give orders of magnitude speedup in high-dimensional applications. We prove that the exact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Markov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models
