Quantifying the mini-batching error in Bayesian inference for Adaptive Langevin dynamics
Inass Sekkat, Gabriel Stoltz

TL;DR
This paper analyzes the impact of mini-batching noise on Bayesian inference using Langevin dynamics and proposes Adaptive Langevin dynamics to correct for this bias, with extensions to improve accuracy.
Contribution
It quantifies the mini-batching bias in Bayesian Langevin methods and introduces Adaptive Langevin dynamics to mitigate this bias in practical scenarios.
Findings
Mini-batching introduces significant bias in Langevin-based Bayesian inference.
Adaptive Langevin dynamics can correct for mini-batching noise under certain assumptions.
Extensions of Adaptive Langevin can further reduce bias in complex models.
Abstract
Bayesian inference allows to obtain useful information on the parameters of models, either in computational statistics or more recently in the context of Bayesian Neural Networks. The computational cost of usual Monte Carlo methods for sampling posterior laws in Bayesian inference scales linearly with the number of data points. One option to reduce it to a fraction of this cost is to resort to mini-batching in conjunction with unadjusted discretizations of Langevin dynamics, in which case only a random fraction of the data is used to estimate the gradient. However, this leads to an additional noise in the dynamics and hence a bias on the invariant measure which is sampled by the Markov chain. We advocate using the so-called Adaptive Langevin dynamics, which is a modification of standard inertial Langevin dynamics with a dynamical friction which automatically corrects for the increased…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods · Target Tracking and Data Fusion in Sensor Networks
