Universal Average-Case Optimality of Polyak Momentum
Damien Scieur, Fabian Pedregosa

TL;DR
This paper demonstrates that Polyak momentum (PM) is asymptotically optimal in both worst-case and average-case scenarios for quadratic optimization, providing a new understanding of its empirical success.
Contribution
The paper proves that any optimal average-case method converges similarly to PM, establishing PM's universal average-case optimality under mild assumptions.
Findings
PM is asymptotically both worst-case and average-case optimal.
Any optimal average-case method converges to PM.
Provides a new theoretical foundation for PM's empirical effectiveness.
Abstract
Polyak momentum (PM), also known as the heavy-ball method, is a widely used optimization method that enjoys an asymptotic optimal worst-case complexity on quadratic objectives. However, its remarkable empirical success is not fully explained by this optimality, as the worst-case analysis -- contrary to the average-case -- is not representative of the expected complexity of an algorithm. In this work we establish a novel link between PM and the average-case analysis. Our main contribution is to prove that any optimal average-case method converges in the number of iterations to PM, under mild assumptions. This brings a new perspective on this classical method, showing that PM is asymptotically both worst-case and average-case optimal.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research
