Universal heavy-ball method for nonconvex optimization under H\"older continuous Hessians

Naoki Marumo; Akiko Takeda

arXiv:2303.01073·math.OC·January 5, 2026·1 cites

Universal heavy-ball method for nonconvex optimization under H\"older continuous Hessians

Naoki Marumo, Akiko Takeda

PDF

Open Access 1 Repo

TL;DR

This paper introduces a universal heavy-ball optimization method for nonconvex functions with H"older continuous Hessians, achieving optimal complexity without prior knowledge of problem-specific parameters.

Contribution

It develops a $ u$-independent heavy-ball method with restart mechanisms that adaptively attains optimal convergence rates for nonconvex optimization.

Findings

01

Achieves gradient norm less than $\

02

Demonstrates effectiveness through numerical experiments.

03

Does not require prior knowledge of H"older constants or Lipschitz parameters.

Abstract

We propose a new first-order method for minimizing nonconvex functions with Lipschitz continuous gradients and H\"older continuous Hessians. The proposed algorithm is a heavy-ball method equipped with two particular restart mechanisms. It finds a solution where the gradient norm is less than $ϵ$ in $O (H_{ν}^{\frac{1}{2 + 2 ν}} ϵ^{- \frac{4 + 3 ν}{2 + 2 ν}})$ function and gradient evaluations, where $ν \in [0, 1]$ and $H_{ν}$ are the H\"older exponent and constant, respectively. Our algorithm is $ν$ -independent and thus universal; it automatically achieves the above complexity bound with the optimal $ν \in [0, 1]$ without knowledge of $H_{ν}$ . In addition, the algorithm does not require other problem-dependent parameters as input, including the gradient's Lipschitz constant or the target accuracy $ϵ$ . Numerical results illustrate that the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

n-marumo/restarted-hb
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Optimization and Variational Analysis