Universal heavy-ball method for nonconvex optimization under H\"older continuous Hessians
Naoki Marumo, Akiko Takeda

TL;DR
This paper introduces a universal heavy-ball optimization method for nonconvex functions with H"older continuous Hessians, achieving optimal complexity without prior knowledge of problem-specific parameters.
Contribution
It develops a $ u$-independent heavy-ball method with restart mechanisms that adaptively attains optimal convergence rates for nonconvex optimization.
Findings
Achieves gradient norm less than $\
Demonstrates effectiveness through numerical experiments.
Does not require prior knowledge of H"older constants or Lipschitz parameters.
Abstract
We propose a new first-order method for minimizing nonconvex functions with Lipschitz continuous gradients and H\"older continuous Hessians. The proposed algorithm is a heavy-ball method equipped with two particular restart mechanisms. It finds a solution where the gradient norm is less than in function and gradient evaluations, where and are the H\"older exponent and constant, respectively. Our algorithm is -independent and thus universal; it automatically achieves the above complexity bound with the optimal without knowledge of . In addition, the algorithm does not require other problem-dependent parameters as input, including the gradient's Lipschitz constant or the target accuracy . Numerical results illustrate that the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Optimization and Variational Analysis
