Lower Bounds for Finding Stationary Points I
Yair Carmon, John C. Duchi, Oliver Hinder, Aaron Sidford

TL;DR
This paper establishes tight lower bounds on the number of queries needed by algorithms to find approximate stationary points in smooth, possibly non-convex functions, demonstrating the optimality of several common optimization methods.
Contribution
It provides the first sharp lower bounds on the complexity of finding stationary points in high-dimensional non-convex optimization, confirming the optimality of existing algorithms.
Findings
Lower bounds match the complexity of gradient descent and Newton's method.
Optimality of first- and higher-order regularization methods is established.
Results apply to a broad class of smooth, non-convex functions.
Abstract
We prove lower bounds on the complexity of finding -stationary points (points such that ) of smooth, high-dimensional, and potentially non-convex functions . We consider oracle-based complexity measures, where an algorithm is given access to the value and all derivatives of at a query point . We show that for any (potentially randomized) algorithm , there exists a function with Lipschitz th order derivatives such that requires at least queries to find an -stationary point. Our lower bounds are sharp to within constants, and they show that gradient descent, cubic-regularized Newton's method, and generalized th order regularization are worst-case optimal within their natural function classes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
