A Complete Recipe for Stochastic Gradient MCMC
Yi-An Ma, Tianqi Chen, Emily B. Fox

TL;DR
This paper introduces a comprehensive framework for constructing Markov chain Monte Carlo (MCMC) samplers using continuous dynamics, enabling scalable stochastic gradient methods with provable convergence and new sampler designs.
Contribution
The paper provides a complete recipe for building MCMC samplers based on continuous Markov processes, unifying existing methods and enabling new scalable algorithms like SGRHMC.
Findings
Framework can represent any continuous Markov process for sampling
Previous samplers can be derived within this unified framework
Proposed SGRHMC combines Riemann HMC benefits with stochastic scalability
Abstract
Many recent Markov chain Monte Carlo (MCMC) samplers leverage continuous dynamics to define a transition kernel that efficiently explores a target distribution. In tandem, a focus has been on devising scalable variants that subsample the data and use stochastic gradients in place of full-data gradients in the dynamic simulations. However, such stochastic gradient MCMC samplers have lagged behind their full-data counterparts in terms of the complexity of dynamics considered since proving convergence in the presence of the stochastic gradient noise is non-trivial. Even with simple dynamics, significant physical intuition is often required to modify the dynamical system to account for the stochastic gradient noise. In this paper, we provide a general recipe for constructing MCMC samplers--including stochastic gradient versions--based on continuous Markov processes specified via two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Ion-surface interactions and analysis · Mass Spectrometry Techniques and Applications
