On the complexity of finding stationary points of smooth functions in   one dimension

Sinho Chewi; S\'ebastien Bubeck; Adil Salim

arXiv:2209.07513·math.OC·March 21, 2023·ALT

On the complexity of finding stationary points of smooth functions in one dimension

Sinho Chewi, S\'ebastien Bubeck, Adil Salim

PDF

Open Access

TL;DR

This paper analyzes the query complexity of finding stationary points in one-dimensional smooth functions, revealing the benefits of randomness and zeroth-order information, and establishing the optimality of gradient descent among certain algorithms.

Contribution

It characterizes the query complexity in various settings and proves the optimality of gradient descent among deterministic first-order methods.

Findings

01

Randomness or zeroth-order info improves algorithm performance.

02

Gradient descent is optimal among deterministic first-order algorithms.

03

The study covers all dimensions d ≥ 1.

Abstract

We characterize the query complexity of finding stationary points of one-dimensional non-convex but smooth functions. We consider four settings, based on whether the algorithms under consideration are deterministic or randomized, and whether the oracle outputs $1^{st}$ -order or both $0^{th}$ - and $1^{st}$ -order information. Our results show that algorithms for this task provably benefit by incorporating either randomness or $0^{th}$ -order information. Our results also show that, for every dimension $d \geq 1$ , gradient descent is optimal among deterministic algorithms using $1^{st}$ -order queries only.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Complexity and Algorithms in Graphs · Stochastic Gradient Optimization Techniques