(Non-) asymptotic properties of Stochastic Gradient Langevin Dynamics

Sebastian J. Vollmer; Konstantinos C. Zygalakis; and Yee Whye; Teh

arXiv:1501.00438·stat.ME·September 22, 2015·29 cites

(Non-) asymptotic properties of Stochastic Gradient Langevin Dynamics

Sebastian J. Vollmer, Konstantinos C. Zygalakis, and Yee Whye, Teh

PDF

Open Access

TL;DR

This paper analyzes the behavior of stochastic gradient Langevin dynamics (SGLD) with fixed step size, characterizing its bias and variance, and proposes modifications to improve its accuracy in large data set applications.

Contribution

It provides explicit bias characterization for fixed step size SGLD and introduces a modified version that reduces asymptotic bias due to stochastic gradient variance.

Findings

01

Explicit bias dependence on step size and gradient variance

02

Bounds on bias, variance, and mean squared error for finite iterations

03

Demonstration of SGLD's advantages over Euler method in large data regimes

Abstract

Applying standard Markov chain Monte Carlo (MCMC) algorithms to large data sets is computationally infeasible. The recently proposed stochastic gradient Langevin dynamics (SGLD) method circumvents this problem in three ways: it generates proposed moves using only a subset of the data, it skips the Metropolis-Hastings accept-reject step, and it uses sequences of decreasing step sizes. In \cite{TehThierryVollmerSGLD2014}, we provided the mathematical foundations for the decreasing step size SGLD, including consistency and a central limit theorem. However, in practice the SGLD is run for a relatively small number of iterations, and its step size is not decreased to zero. The present article investigates the behaviour of the SGLD with fixed step size. In particular we characterise the asymptotic bias explicitly, along with its dependence on the step size and the variance of the stochastic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference · Advanced Neuroimaging Techniques and Applications