Convergence of Langevin MCMC in KL-divergence
Xiang Cheng, Peter Bartlett

TL;DR
This paper analyzes the convergence rate of Langevin MCMC algorithms in KL-divergence, providing new bounds under smooth and strongly convex conditions, and extending to non-strongly convex cases with a gradient flow perspective.
Contribution
It establishes the convergence rate of Langevin diffusion in KL-divergence under smooth and strongly convex conditions, and offers a novel analysis using gradient flow methods.
Findings
Langevin diffusion achieves KL divergence $ o 0$ in $ ilde{O}(d/\epsilon)$ steps under strong convexity.
The analysis applies to convergence in KL-divergence, a stronger metric than total variation.
Provides a simpler proof of existing convergence results using gradient flow perspective.
Abstract
Langevin diffusion is a commonly used tool for sampling from a given distribution. In this work, we establish that when the target density is such that is smooth and strongly convex, discrete Langevin diffusion produces a distribution with in steps, where is the dimension of the sample space. We also study the convergence rate when the strong-convexity assumption is absent. By considering the Langevin diffusion as a gradient flow in the space of probability distributions, we obtain an elegant analysis that applies to the stronger property of convergence in KL-divergence and gives a conceptually simpler proof of the best-known convergence results in weaker metrics.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Stochastic processes and statistical mechanics
