On the complexity of finding stationary points of smooth functions in one dimension
Sinho Chewi, S\'ebastien Bubeck, Adil Salim

TL;DR
This paper analyzes the query complexity of finding stationary points in one-dimensional smooth functions, revealing the benefits of randomness and zeroth-order information, and establishing the optimality of gradient descent among certain algorithms.
Contribution
It characterizes the query complexity in various settings and proves the optimality of gradient descent among deterministic first-order methods.
Findings
Randomness or zeroth-order info improves algorithm performance.
Gradient descent is optimal among deterministic first-order algorithms.
The study covers all dimensions d ≥ 1.
Abstract
We characterize the query complexity of finding stationary points of one-dimensional non-convex but smooth functions. We consider four settings, based on whether the algorithms under consideration are deterministic or randomized, and whether the oracle outputs -order or both - and -order information. Our results show that algorithms for this task provably benefit by incorporating either randomness or -order information. Our results also show that, for every dimension , gradient descent is optimal among deterministic algorithms using -order queries only.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Stochastic Gradient Optimization Techniques
