Higher Order Generalization Error for First Order Discretization of   Langevin Diffusion

Mufan Bill Li; Maxime Gazeau

arXiv:2102.06229·stat.ML·February 15, 2021

Higher Order Generalization Error for First Order Discretization of Langevin Diffusion

Mufan Bill Li, Maxime Gazeau

PDF

Open Access

TL;DR

This paper analyzes the generalization error of first order discretizations of Langevin diffusion, showing under smoothness assumptions that they can achieve arbitrarily fast convergence rates in terms of iterations.

Contribution

It introduces smoothness conditions under which first order Langevin discretizations attain faster generalization error bounds than previously known.

Findings

01

First order methods can achieve arbitrarily fast convergence with smoothness assumptions.

02

The required number of iterations scales as psilon^{-1/N} for any N>0.

03

Smoothness assumptions enable improved generalization error bounds.

Abstract

We propose a novel approach to analyze generalization error for discretizations of Langevin diffusion, such as the stochastic gradient Langevin dynamics (SGLD). For an $ϵ$ tolerance of expected generalization error, it is known that a first order discretization can reach this target if we run $Ω (ϵ^{- 1} lo g (ϵ^{- 1}))$ iterations with $Ω (ϵ^{- 1})$ samples. In this article, we show that with additional smoothness assumptions, even first order methods can achieve arbitrarily runtime complexity. More precisely, for each $N > 0$ , we provide a sufficient smoothness condition on the loss function such that a first order discretization can reach $ϵ$ expected generalization error given $Ω (ϵ^{- 1/ N} lo g (ϵ^{- 1}))$ iterations with $Ω (ϵ^{- 1})$ samples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Mathematical Biology Tumor Growth · Advanced Mathematical Modeling in Engineering