The Global R-linear Convergence of Nesterov's Accelerated Gradient Method with Unknown Strongly Convex Parameter
Chenglong Bao, Liang Chen, Jiahong Li

TL;DR
This paper proves that Nesterov's accelerated gradient method with an unknown strong convexity parameter achieves global R-linear convergence, extending to proximal methods and contradicting previous continuous-time analyses.
Contribution
It establishes the first proof of global R-linear convergence for NAG with unknown strong convexity, using Lyapunov sequences, and extends results to proximal gradient methods.
Findings
Proves Q-linear convergence of Lyapunov sequences for NAG with unknown db.
Extends convergence results to accelerated proximal gradient methods.
Contradicts previous continuous-time convergence rate limitations.
Abstract
The Nesterov accelerated gradient (NAG) method is an important extrapolation-based numerical algorithm that accelerates the convergence of the gradient descent method in convex optimization. When dealing with an objective function that is -strongly convex, selecting extrapolation coefficients dependent on enables global R-linear convergence. In cases where is unknown, a commonly adopted approach is to set the extrapolation coefficient using the original NAG method. This choice allows for achieving the optimal iteration complexity among first-order methods for general convex problems. However, it remains unknown whether the NAG method with an unknown strongly convex parameter exhibits global R-linear convergence for strongly convex problems. In this work, we answer this question positively by establishing the Q-linear convergence of certain constructed Lyapunov…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
