On the Theory of Variance Reduction for Stochastic Gradient Monte Carlo
Niladri S. Chatterji, Nicolas Flammarion, Yi-An Ma, Peter L. Bartlett, and Michael I. Jordan

TL;DR
This paper establishes convergence guarantees in Wasserstein distance for various variance-reduction stochastic gradient Langevin methods under strong convexity and smoothness assumptions, providing theoretical bounds and empirical validation.
Contribution
It introduces a novel proof technique combining optimization and sampling analysis to derive convergence bounds for variance-reduction Langevin methods.
Findings
Variance-reduction methods achieve improved convergence rates.
Theoretical bounds identify regimes where each method excels.
Experimental results confirm the theoretical predictions.
Abstract
We provide convergence guarantees in Wasserstein distance for a variety of variance-reduction methods: SAGA Langevin diffusion, SVRG Langevin diffusion and control-variate underdamped Langevin diffusion. We analyze these methods under a uniform set of assumptions on the log-posterior distribution, assuming it to be smooth, strongly convex and Hessian Lipschitz. This is achieved by a new proof technique combining ideas from finite-sum optimization and the analysis of sampling methods. Our sharp theoretical bounds allow us to identify regimes of interest where each method performs better than the others. Our theory is verified with experiments on real-world and synthetic datasets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference
MethodsSAGA
