Heavy-ball Differential Equation Achieves $O(\varepsilon^{-7/4})$   Convergence for Nonconvex Functions

Kaito Okamura; Naoki Marumo; and Akiko Takeda

arXiv:2406.06100·math.OC·May 2, 2025

Heavy-ball Differential Equation Achieves $O(\varepsilon^{-7/4})$ Convergence for Nonconvex Functions

Kaito Okamura, Naoki Marumo, and Akiko Takeda

PDF

Open Access

TL;DR

This paper demonstrates that the heavy-ball differential equation, a continuous-time model of optimization algorithms, can reach an $ ext{ε}$-stationary point in $O( ext{ε}^{-7/4})$ time for nonconvex functions, simplifying convergence analysis.

Contribution

The paper establishes that the heavy-ball differential equation achieves the optimal $O( ext{ε}^{-7/4})$ convergence rate without additional mechanisms, providing a simpler approach for nonconvex optimization.

Findings

01

Heavy-ball differential equation attains $O(ε^{-7/4})$ convergence.

02

Simplifies understanding of heavy-ball method dynamics.

03

Provides a continuous-time perspective on optimization complexity.

Abstract

First-order optimization methods for nonconvex functions with Lipschitz continuous gradient and Hessian have been extensively studied. State-of-the-art methods for finding an $ε$ -stationary point within $O (ε^{- 7/4})$ or $\tilde{O} (ε^{- 7/4})$ gradient evaluations are based on Nesterov's accelerated gradient descent (AGD) or Polyak's heavy-ball (HB) method. However, these algorithms employ additional mechanisms, such as restart schemes and negative curvature exploitation, which complicate their behavior and make it challenging to apply them to more advanced settings (e.g., stochastic optimization). As a first step in investigating whether a simple algorithm with $O (ε^{- 7/4})$ complexity can be constructed without such additional mechanisms, we study the HB differential equation, a continuous-time analogue of the AGD and HB methods. We prove…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Variational Analysis · Advanced Optimization Algorithms Research