Gradient Descent Is Optimal Under Lower Restricted Secant Inequality And Upper Error Bound
Charles Guille-Escuret, Adam Ibrahim, Baptiste Goujaud, Ioannis, Mitliagkas

TL;DR
This paper demonstrates that gradient descent is exactly optimal for a class of functions characterized by a lower restricted secant inequality and an upper error bound, which includes some non-convex functions and avoids pathological condition number issues.
Contribution
It introduces a new function class with geometrical properties where gradient descent is proven to be optimal among all first-order methods.
Findings
Gradient descent achieves the lower bound on convergence rate for this class.
The class includes some non-convex functions, broadening the scope of analysis.
Analytical solutions to performance estimation problems are derived for this class.
Abstract
The study of first-order optimization is sensitive to the assumptions made on the objective functions. These assumptions induce complexity classes which play a key role in worst-case analysis, including the fundamental concept of algorithm optimality. Recent work argues that strong convexity and smoothness, popular assumptions in literature, lead to a pathological definition of the condition number (Guille-Escuret et al., 2021). Motivated by this result, we focus on the class of functions satisfying a lower restricted secant inequality and an upper error bound. On top of being robust to the aforementioned pathological behavior and including some non-convex functions, this pair of conditions displays interesting geometrical properties. In particular, the necessary and sufficient conditions to interpolate a set of points and their gradients within the class can be separated into simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Stochastic Gradient Optimization Techniques
