Higher Order Generalization Error for First Order Discretization of Langevin Diffusion
Mufan Bill Li, Maxime Gazeau

TL;DR
This paper analyzes the generalization error of first order discretizations of Langevin diffusion, showing under smoothness assumptions that they can achieve arbitrarily fast convergence rates in terms of iterations.
Contribution
It introduces smoothness conditions under which first order Langevin discretizations attain faster generalization error bounds than previously known.
Findings
First order methods can achieve arbitrarily fast convergence with smoothness assumptions.
The required number of iterations scales as psilon^{-1/N} for any N>0.
Smoothness assumptions enable improved generalization error bounds.
Abstract
We propose a novel approach to analyze generalization error for discretizations of Langevin diffusion, such as the stochastic gradient Langevin dynamics (SGLD). For an tolerance of expected generalization error, it is known that a first order discretization can reach this target if we run iterations with samples. In this article, we show that with additional smoothness assumptions, even first order methods can achieve arbitrarily runtime complexity. More precisely, for each , we provide a sufficient smoothness condition on the loss function such that a first order discretization can reach expected generalization error given iterations with samples.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Mathematical Biology Tumor Growth · Advanced Mathematical Modeling in Engineering
