Adaptive Stochastic Gradient Langevin Dynamics: Taming Convergence and   Saddle Point Escape Time

Hejian Sang; Jia Liu

arXiv:1805.09416·cs.LG·May 25, 2018·1 cites

Adaptive Stochastic Gradient Langevin Dynamics: Taming Convergence and Saddle Point Escape Time

Hejian Sang, Jia Liu

PDF

Open Access

TL;DR

This paper introduces adaptive stochastic gradient Langevin dynamics algorithms that efficiently escape saddle points and converge to local minima in non-convex optimization, with iteration bounds nearly independent of problem dimension.

Contribution

It proposes a new adaptive Langevin dynamics framework and two specialized algorithms with improved convergence and saddle point escape guarantees.

Findings

01

Escape saddle points in O(log d) iterations

02

Converge to local minima in O(log d / ε^4) iterations

03

Outperforms existing first-order methods in convergence speed

Abstract

In this paper, we propose a new adaptive stochastic gradient Langevin dynamics (ASGLD) algorithmic framework and its two specialized versions, namely adaptive stochastic gradient (ASG) and adaptive gradient Langevin dynamics(AGLD), for non-convex optimization problems. All proposed algorithms can escape from saddle points with at most $O (lo g d)$ iterations, which is nearly dimension-free. Further, we show that ASGLD and ASG converge to a local minimum with at most $O (lo g d / ϵ^{4})$ iterations. Also, ASGLD with full gradients or ASGLD with a slowly linearly increasing batch size converge to a local minimum with iterations bounded by $O (lo g d / ϵ^{2})$ , which outperforms existing first-order methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods