Stochastic Gradient Descent in Continuous Time: A Central Limit Theorem

Justin Sirignano; Konstantinos Spiliopoulos

arXiv:1710.04273·math.PR·June 18, 2019

Stochastic Gradient Descent in Continuous Time: A Central Limit Theorem

Justin Sirignano, Konstantinos Spiliopoulos

PDF

TL;DR

This paper establishes a central limit theorem for stochastic gradient descent in continuous time, providing insights into its convergence behavior for both convex and non-convex functions.

Contribution

It introduces a CLT for SGD in continuous time, extending understanding of its asymptotic distribution and convergence rates in complex settings.

Findings

01

Proves a CLT for strongly convex functions.

02

Establishes an $L^{p}$ convergence rate in the strongly convex case.

03

Extends analysis to certain non-convex functions.

Abstract

Stochastic gradient descent in continuous time (SGDCT) provides a computationally efficient method for the statistical learning of continuous-time models, which are widely used in science, engineering, and finance. The SGDCT algorithm follows a (noisy) descent direction along a continuous stream of data. The parameter updates occur in continuous time and satisfy a stochastic differential equation. This paper analyzes the asymptotic convergence rate of the SGDCT algorithm by proving a central limit theorem (CLT) for strongly convex objective functions and, under slightly stronger conditions, for non-convex objective functions as well. An $L^{p}$ convergence rate is also proven for the algorithm in the strongly convex case. The mathematical analysis lies at the intersection of stochastic analysis and statistical learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.