On stochastic gradient Langevin dynamics with dependent data streams:   the fully non-convex case

Ngoc Huy Chau; \'Eric Moulines; Miklos R\'asonyi; Sotirios Sabanis and; Ying Zhang

arXiv:1905.13142·math.ST·February 3, 2021·SIAM J. Math. Data Sci.·26 cites

On stochastic gradient Langevin dynamics with dependent data streams: the fully non-convex case

Ngoc Huy Chau, \'Eric Moulines, Miklos R\'asonyi, Sotirios Sabanis and, Ying Zhang

PDF

Open Access

TL;DR

This paper provides non-asymptotic convergence analysis of Stochastic Gradient Langevin Dynamics (SGLD) algorithms for sampling from complex, non-log-concave distributions, even with dependent data streams, improving existing bounds.

Contribution

It offers sharper, uniform convergence estimates for SGLD in non-convex settings with dependent data, extending prior work to more general data dependencies.

Findings

01

Non-asymptotic $L^1$-Wasserstein convergence bounds established.

02

Analysis accommodates dependent data streams in gradient estimation.

03

Results are sharper and uniform across iterations.

Abstract

We consider the problem of sampling from a target distribution, which is \emph {not necessarily logconcave}, in the context of empirical risk minimization and stochastic optimization as presented in Raginsky et al. (2017). Non-asymptotic analysis results are established in the $L^{1}$ -Wasserstein distance for the behaviour of Stochastic Gradient Langevin Dynamics (SGLD) algorithms. We allow the estimation of gradients to be performed even in the presence of \emph{dependent} data streams. Our convergence estimates are sharper and \emph{uniform} in the number of iterations, in contrast to those in previous studies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Advanced Neuroimaging Techniques and Applications · Statistical Methods and Inference