Accelerated Methods for Non-Convex Optimization
Yair Carmon, John C. Duchi, Oliver Hinder, Aaron Sidford

TL;DR
This paper introduces a Hessian-free accelerated gradient method for non-convex optimization that achieves faster convergence to stationary points with second-order guarantees, suitable for large-scale problems.
Contribution
It proposes a novel accelerated method with improved complexity for non-convex optimization, requiring only gradient evaluations and providing second-order guarantees.
Findings
Achieves $O( ext{epsilon}^{-7/4} ext{log}(1/ ext{epsilon}))$ complexity
Provides second-order guarantees with Hessian-free approach
Suitable for large-scale non-convex problems
Abstract
We present an accelerated gradient method for non-convex optimization problems with Lipschitz continuous first and second derivatives. The method requires time to find an -stationary point, meaning a point such that . The method improves upon the complexity of gradient descent and provides the additional second-order guarantee that for the computed . Furthermore, our method is Hessian free, i.e. it only requires gradient computations, and is therefore suitable for large scale applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
