The Global R-linear Convergence of Nesterov's Accelerated Gradient Method with Unknown Strongly Convex Parameter

Chenglong Bao; Liang Chen; Jiahong Li

arXiv:2308.14080·math.OC·May 28, 2025

The Global R-linear Convergence of Nesterov's Accelerated Gradient Method with Unknown Strongly Convex Parameter

Chenglong Bao, Liang Chen, Jiahong Li

PDF

Open Access

TL;DR

This paper proves that Nesterov's accelerated gradient method with an unknown strong convexity parameter achieves global R-linear convergence, extending to proximal methods and contradicting previous continuous-time analyses.

Contribution

It establishes the first proof of global R-linear convergence for NAG with unknown strong convexity, using Lyapunov sequences, and extends results to proximal gradient methods.

Findings

01

Proves Q-linear convergence of Lyapunov sequences for NAG with unknown db.

02

Extends convergence results to accelerated proximal gradient methods.

03

Contradicts previous continuous-time convergence rate limitations.

Abstract

The Nesterov accelerated gradient (NAG) method is an important extrapolation-based numerical algorithm that accelerates the convergence of the gradient descent method in convex optimization. When dealing with an objective function that is $μ$ -strongly convex, selecting extrapolation coefficients dependent on $μ$ enables global R-linear convergence. In cases where $μ$ is unknown, a commonly adopted approach is to set the extrapolation coefficient using the original NAG method. This choice allows for achieving the optimal iteration complexity among first-order methods for general convex problems. However, it remains unknown whether the NAG method with an unknown strongly convex parameter exhibits global R-linear convergence for strongly convex problems. In this work, we answer this question positively by establishing the Q-linear convergence of certain constructed Lyapunov…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques