Restarted Nonconvex Accelerated Gradient Descent: No More Polylogarithmic Factor in the $O(\epsilon^{-7/4})$ Complexity
Huan Li, Zhouchen Lin

TL;DR
This paper introduces simple restarted accelerated gradient methods for nonconvex optimization that achieve an improved $O(rac{1}{epsilon^{7/4}})$ complexity without polylogarithmic factors, using elementary proofs.
Contribution
The paper presents two new restarted accelerated gradient methods with improved complexity bounds and elementary analysis, avoiding complex techniques used in prior work.
Findings
Achieves $O(epsilon^{-7/4})$ complexity for first-order stationary points.
Complexity bound improves previous results by removing polylogarithmic factors.
Methods are simple, based on classical AGD and HB with restart mechanisms.
Abstract
This paper studies accelerated gradient methods for nonconvex optimization with Lipschitz continuous gradient and Hessian. We propose two simple accelerated gradient methods, restarted accelerated gradient descent (AGD) and restarted heavy ball (HB) method, and establish that our methods achieve an -approximate first-order stationary point within number of gradient evaluations by elementary proofs. Theoretically, our complexity does not hide any polylogarithmic factors, and thus it improves over the best known one by the factor. Our algorithms are simple in the sense that they only consist of Nesterov's classical AGD or Polyak's HB iterations, as well as a restart mechanism. They do not invoke negative curvature exploitation or minimization of regularized surrogate functions as the subroutines. In contrast with existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Neuroimaging Techniques and Applications
