Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Non-Convex Stochastic Optimization: Non-Asymptotic Performance Bounds and Momentum-Based Acceleration
Xuefeng Gao, Mert G\"urb\"uzbalaban, Lingjiong Zhu

TL;DR
This paper provides finite-time convergence guarantees for stochastic gradient Hamiltonian Monte Carlo (SGHMC), demonstrating its potential for accelerated non-convex optimization compared to SGLD through explicit complexity bounds.
Contribution
The work offers the first non-asymptotic performance bounds for SGHMC variants in non-convex stochastic optimization, including explicit constants and complexity analysis.
Findings
Finite-time convergence bounds for SGHMC variants.
Tighter complexity bounds for SGHMC than SGLD on certain problems.
Evidence of momentum-based acceleration in global non-convex optimization.
Abstract
Stochastic gradient Hamiltonian Monte Carlo (SGHMC) is a variant of stochastic gradient with momentum where a controlled and properly scaled Gaussian noise is added to the stochastic gradients to steer the iterates towards a global minimum. Many works reported its empirical success in practice for solving stochastic non-convex optimization problems, in particular it has been observed to outperform overdamped Langevin Monte Carlo-based methods such as stochastic gradient Langevin dynamics (SGLD) in many applications. Although asymptotic global convergence properties of SGHMC are well known, its finite-time performance is not well-understood. In this work, we study two variants of SGHMC based on two alternative discretizations of the underdamped Langevin diffusion. We provide finite-time performance bounds for the global convergence of both SGHMC variants for solving stochastic non-convex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Stochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference
