Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for   Non-Convex Stochastic Optimization: Non-Asymptotic Performance Bounds and   Momentum-Based Acceleration

Xuefeng Gao; Mert G\"urb\"uzbalaban; Lingjiong Zhu

arXiv:1809.04618·math.OC·November 19, 2020·31 cites

Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Non-Convex Stochastic Optimization: Non-Asymptotic Performance Bounds and Momentum-Based Acceleration

Xuefeng Gao, Mert G\"urb\"uzbalaban, Lingjiong Zhu

PDF

Open Access

TL;DR

This paper provides finite-time convergence guarantees for stochastic gradient Hamiltonian Monte Carlo (SGHMC), demonstrating its potential for accelerated non-convex optimization compared to SGLD through explicit complexity bounds.

Contribution

The work offers the first non-asymptotic performance bounds for SGHMC variants in non-convex stochastic optimization, including explicit constants and complexity analysis.

Findings

01

Finite-time convergence bounds for SGHMC variants.

02

Tighter complexity bounds for SGHMC than SGLD on certain problems.

03

Evidence of momentum-based acceleration in global non-convex optimization.

Abstract

Stochastic gradient Hamiltonian Monte Carlo (SGHMC) is a variant of stochastic gradient with momentum where a controlled and properly scaled Gaussian noise is added to the stochastic gradients to steer the iterates towards a global minimum. Many works reported its empirical success in practice for solving stochastic non-convex optimization problems, in particular it has been observed to outperform overdamped Langevin Monte Carlo-based methods such as stochastic gradient Langevin dynamics (SGLD) in many applications. Although asymptotic global convergence properties of SGHMC are well known, its finite-time performance is not well-understood. In this work, we study two variants of SGHMC based on two alternative discretizations of the underdamped Langevin diffusion. We provide finite-time performance bounds for the global convergence of both SGHMC variants for solving stochastic non-convex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Stochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference