On the Theory of Variance Reduction for Stochastic Gradient Monte Carlo

Niladri S. Chatterji; Nicolas Flammarion; Yi-An Ma; Peter L. Bartlett; and Michael I. Jordan

arXiv:1802.05431·stat.ML·February 16, 2018·29 cites

On the Theory of Variance Reduction for Stochastic Gradient Monte Carlo

Niladri S. Chatterji, Nicolas Flammarion, Yi-An Ma, Peter L. Bartlett, and Michael I. Jordan

PDF

Open Access

TL;DR

This paper establishes convergence guarantees in Wasserstein distance for various variance-reduction stochastic gradient Langevin methods under strong convexity and smoothness assumptions, providing theoretical bounds and empirical validation.

Contribution

It introduces a novel proof technique combining optimization and sampling analysis to derive convergence bounds for variance-reduction Langevin methods.

Findings

01

Variance-reduction methods achieve improved convergence rates.

02

Theoretical bounds identify regimes where each method excels.

03

Experimental results confirm the theoretical predictions.

Abstract

We provide convergence guarantees in Wasserstein distance for a variety of variance-reduction methods: SAGA Langevin diffusion, SVRG Langevin diffusion and control-variate underdamped Langevin diffusion. We analyze these methods under a uniform set of assumptions on the log-posterior distribution, assuming it to be smooth, strongly convex and Hessian Lipschitz. This is achieved by a new proof technique combining ideas from finite-sum optimization and the analysis of sampling methods. Our sharp theoretical bounds allow us to identify regimes of interest where each method performs better than the others. Our theory is verified with experiments on real-world and synthetic datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference

MethodsSAGA