"Convex Until Proven Guilty": Dimension-Free Acceleration of Gradient Descent on Non-Convex Functions
Yair Carmon, Oliver Hinder, John C. Duchi, Aaron Sidford

TL;DR
This paper introduces a dimension-free accelerated gradient descent method for non-convex functions that either converges quickly or provides a certificate of non-convexity, enabling efficient optimization.
Contribution
It presents a novel AGD variant that detects non-convexity and exploits negative curvature for faster convergence on non-convex functions.
Findings
Achieves dimension-free acceleration for non-convex optimization.
Provides convergence guarantees with specific evaluation complexities.
Offers a non-convexity certificate to guide optimization strategies.
Abstract
We develop and analyze a variant of Nesterov's accelerated gradient descent (AGD) for minimization of smooth non-convex functions. We prove that one of two cases occurs: either our AGD variant converges quickly, as if the function was convex, or we produce a certificate that the function is "guilty" of being non-convex. This non-convexity certificate allows us to exploit negative curvature and obtain deterministic, dimension-free acceleration of convergence for non-convex functions. For a function with Lipschitz continuous gradient and Hessian, we compute a point with in gradient and function evaluations. Assuming additionally that the third derivative is Lipschitz, we require only evaluations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Numerical methods in inverse problems
