The True Cost of Stochastic Gradient Langevin Dynamics

Tigran Nagapetyan; Andrew B. Duncan; Leonard Hasenclever; Sebastian J.; Vollmer; Lukasz Szpruch; Konstantinos Zygalakis

arXiv:1706.02692·stat.ME·June 9, 2017·32 cites

The True Cost of Stochastic Gradient Langevin Dynamics

Tigran Nagapetyan, Andrew B. Duncan, Leonard Hasenclever, Sebastian J., Vollmer, Lukasz Szpruch, Konstantinos Zygalakis

PDF

Open Access

TL;DR

This paper analyzes the bias and computational cost of Stochastic Gradient Langevin Dynamics (SGLD) in Bayesian inference, showing how stepsize choices affect accuracy and proposing methods to reduce costs while maintaining credible interval coverage.

Contribution

It provides a theoretical analysis of SGLD bias and cost, demonstrating the impact of stepsize and batchsize, and introduces a control variate approach to improve efficiency.

Findings

01

Bias in SGLD depends on stepsize and batchsize.

02

Cost to achieve target accuracy is similar across batchsizes without control variates.

03

Control variates significantly reduce computational cost.

Abstract

The problem of posterior inference is central to Bayesian statistics and a wealth of Markov Chain Monte Carlo (MCMC) methods have been proposed to obtain asymptotically correct samples from the posterior. As datasets in applications grow larger and larger, scalability has emerged as a central problem for MCMC methods. Stochastic Gradient Langevin Dynamics (SGLD) and related stochastic gradient Markov Chain Monte Carlo methods offer scalability by using stochastic gradients in each step of the simulated dynamics. While these methods are asymptotically unbiased if the stepsizes are reduced in an appropriate fashion, in practice constant stepsizes are used. This introduces a bias that is often ignored. In this paper we study the mean squared error of Lipschitz functionals in strongly log- concave models with i.i.d. data of growing data set size and show that, given a batchsize, to control…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference