Functional Central Limit Theorem and Strong Law of Large Numbers for   Stochastic Gradient Langevin Dynamics

Attila Lovas; Mikl\'os R\'asonyi

arXiv:2210.02092·math.PR·August 1, 2023

Functional Central Limit Theorem and Strong Law of Large Numbers for Stochastic Gradient Langevin Dynamics

Attila Lovas, Mikl\'os R\'asonyi

PDF

Open Access

TL;DR

This paper establishes a strong law of large numbers and a functional central limit theorem for stochastic gradient Langevin dynamics (SGLD) with fixed step size, even when data are dependent, advancing theoretical understanding of SGLD's long-term behavior.

Contribution

It provides the first rigorous analysis of SGLD's asymptotic properties under dependent data streams, modeling it as a Markov chain in a random environment.

Findings

01

Proves a strong law of large numbers for SGLD.

02

Establishes a functional central limit theorem for SGLD.

03

Handles dependent data streams in the analysis.

Abstract

We study the mixing properties of an important optimization algorithm of machine learning: the stochastic gradient Langevin dynamics (SGLD) with a fixed step size. The data stream is not assumed to be independent hence the SGLD is not a Markov chain, merely a \emph{Markov chain in a random environment}, which complicates the mathematical treatment considerably. We derive a strong law of large numbers and a functional central limit theorem for SGLD.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Stochastic Gradient Optimization Techniques