Random Reshuffling for Stochastic Gradient Langevin Dynamics
Luke Shaw, Peter A. Whalley

TL;DR
This paper investigates Random Reshuffling as an alternative to Robbins-Monro in Stochastic Gradient Langevin Dynamics, demonstrating reduced bias and improved efficiency through theoretical proofs and empirical tests.
Contribution
It introduces Random Reshuffling into Langevin sampling, providing theoretical and empirical evidence of its bias reduction and efficiency benefits over traditional strategies.
Findings
Reduced bias in Wasserstein metric for strongly convex potentials
Analytical demonstration of bias reduction in Gaussian models
Empirical bias reduction in logistic regression experiments
Abstract
We examine the use of different randomisation policies for stochastic gradient algorithms used in sampling, based on first-order (or overdamped) Langevin dynamics, the most popular of which is known as Stochastic Gradient Langevin Dynamics. Conventionally, this algorithm is combined with a specific stochastic gradient strategy, called Robbins-Monro. In this work, we study an alternative strategy, Random Reshuffling, and show convincingly that it leads to improved performance via: a) a proof of reduced bias in the Wasserstein metric for strongly convex, gradient Lipschitz potentials; b) an analytical demonstration of reduced bias for a Gaussian model problem; and c) an empirical demonstration of reduced bias in numerical experiments for some logistic regression problems. This is especially important since Random Reshuffling is typically more efficient due to memory access and cache…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Theoretical and Computational Physics
MethodsLogistic Regression
