Universal Average-Case Optimality of Polyak Momentum

Damien Scieur; Fabian Pedregosa

arXiv:2002.04664·math.OC·January 25, 2021·1 cites

Universal Average-Case Optimality of Polyak Momentum

Damien Scieur, Fabian Pedregosa

PDF

Open Access 4 Repos

TL;DR

This paper demonstrates that Polyak momentum (PM) is asymptotically optimal in both worst-case and average-case scenarios for quadratic optimization, providing a new understanding of its empirical success.

Contribution

The paper proves that any optimal average-case method converges similarly to PM, establishing PM's universal average-case optimality under mild assumptions.

Findings

01

PM is asymptotically both worst-case and average-case optimal.

02

Any optimal average-case method converges to PM.

03

Provides a new theoretical foundation for PM's empirical effectiveness.

Abstract

Polyak momentum (PM), also known as the heavy-ball method, is a widely used optimization method that enjoys an asymptotic optimal worst-case complexity on quadratic objectives. However, its remarkable empirical success is not fully explained by this optimality, as the worst-case analysis -- contrary to the average-case -- is not representative of the expected complexity of an algorithm. In this work we establish a novel link between PM and the average-case analysis. Our main contribution is to prove that any optimal average-case method converges in the number of iterations to PM, under mild assumptions. This brings a new perspective on this classical method, showing that PM is asymptotically both worst-case and average-case optimal.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research