Provable non-accelerations of the heavy-ball method
Baptiste Goujaud, Adrien Taylor, Aymeric Dieuleveut

TL;DR
This paper proves that the heavy-ball method cannot achieve accelerated convergence rates on smooth strongly convex problems, showing limitations of this popular optimization technique.
Contribution
It establishes that the heavy-ball method provably does not accelerate convergence on smooth strongly convex functions, resolving an open question about its fundamental limitations.
Findings
Heavy-ball method does not reach accelerated rates on quadratic functions.
Existence of functions where heavy-ball fails to converge or cycles.
Results are robust to perturbations and higher-order regularity.
Abstract
In this work, we show that the heavy-ball () method provably does not reach an accelerated convergence rate on smooth strongly convex problems. More specifically, we show that for any condition number and any choice of algorithmic parameters, either the worst-case convergence rate of on the class of -smooth and -strongly convex \textit{quadratic} functions is not accelerated (that is, slower than ), or there exists an -smooth -strongly convex function and an initialization such that the method does not converge. To the best of our knowledge, this result closes a simple yet open question on one of the most used and iconic first-order optimization technique. Our approach builds on finding functions for which fails to converge and instead cycles over finitely many iterates. We analytically describe all parametrizations of …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research · Advanced Bandit Algorithms Research
