Langevin Dynamics: A Unified Perspective on Optimization via Lyapunov Potentials
August Y. Chen, Ayush Sekhari, Karthik Sridharan

TL;DR
This paper introduces a new Lyapunov potential-based framework for analyzing the convergence of Stochastic Gradient Langevin Dynamics (SGLD) in non-convex optimization, providing improved theoretical guarantees and finite gradient complexity bounds.
Contribution
It presents a novel Lyapunov potential approach for analyzing SGLD, extending convergence guarantees to stochastic gradient settings with finite gradient complexity.
Findings
Improved convergence rates for SGLD in non-convex optimization.
First finite gradient complexity guarantee for Lipschitz functions with Poincaré inequality.
Discrete-time SGLD inherits success guarantees from continuous Langevin Dynamics.
Abstract
We study the problem of non-convex optimization using Stochastic Gradient Langevin Dynamics (SGLD). SGLD is a natural and popular variation of stochastic gradient descent where at each step, appropriately scaled Gaussian noise is added. To our knowledge, the only strategy for showing global convergence of SGLD on the loss function is to show that SGLD can sample from a stationary distribution which assigns larger mass when the function is small (the Gibbs measure), and then to convert these guarantees to optimization results. We employ a new strategy to analyze the convergence of SGLD to global minima, based on Lyapunov potentials and optimization. We convert the same mild conditions from previous works on SGLD into geometric properties based on Lyapunov potentials. This adapts well to the case with a stochastic gradient oracle, which is natural for machine learning applications where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Thermodynamics and Statistical Mechanics · stochastic dynamics and bifurcation · Molecular Communication and Nanonetworks
