Conditions for linear convergence of the gradient method for non-convex   optimization

Hadi Abbaszadehpeivasti; Etienne de Klerk; Moslem Zamani

arXiv:2204.00647·math.OC·April 5, 2022·Optim. Lett.

Conditions for linear convergence of the gradient method for non-convex optimization

Hadi Abbaszadehpeivasti, Etienne de Klerk, Moslem Zamani

PDF

Open Access

TL;DR

This paper establishes that the Polyak-Lojasiewicz inequality is both necessary and sufficient for the linear convergence of the gradient method with fixed step sizes in certain non-convex smooth optimization problems.

Contribution

It provides a new linear convergence rate for the gradient method under the PL inequality and clarifies its role as a necessary and sufficient condition.

Findings

01

PL inequality is necessary and sufficient for linear convergence

02

Identifies classes of functions with linear convergence

03

Explores relationships between these classes and PL inequality

Abstract

In this paper, we derive a new linear convergence rate for the gradient method with fixed step lengths for non-convex smooth optimization problems satisfying the Polyak-Lojasiewicz (PL) inequality. We establish that the PL inequality is a necessary and sufficient condition for linear convergence to the optimal value for this class of problems. We list some related classes of functions for which the gradient method may enjoy linear convergence rate. Moreover, we investigate their relationship with the PL inequality.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Variational Analysis · Mathematical Inequalities and Applications · Advanced Optimization Algorithms Research