Optimal local linear convergence of Nesterov's accelerated gradient method for $C^2$ functions under the Polyak--{\L}ojasiewicz inequality

Zixu Feng; Hao Yuan

arXiv:2603.21516·math.OC·March 24, 2026

Optimal local linear convergence of Nesterov's accelerated gradient method for $C^2$ functions under the Polyak--{\L}ojasiewicz inequality

Zixu Feng, Hao Yuan

PDF

Open Access

TL;DR

This paper proves that Nesterov's accelerated gradient method achieves the optimal local linear convergence rate for $C^2$ functions satisfying the Polyak--{\

Contribution

It introduces a two-stage analysis that establishes the optimal local linear convergence rate under minimal smoothness assumptions.

Findings

01

Nesterov's method attains the optimal local linear convergence rate.

02

The analysis requires only $C^2$ smoothness, not higher.

03

Numerical experiments support the theoretical results.

Abstract

In this work, we establish that Nesterov's accelerated gradient method, applied to $C^{2}$ functions satisfying the Polyak--{\L}ojasiewicz inequality around local minimizers, achieves the optimal local linear convergence rate $ρ = \frac{3 L + μ - 2 μ}{3 L + μ} + ε$ , where $ε$ is an arbitrarily small constant. Our analysis requires neither higher-order smoothness beyond $C^{2}$ of the objective function nor any additional geometric regularity of the submanifold of local minimizers. The key novelty lies in a two-stage argument: we first establish a coarse yet valid local linear convergence rate and then, building upon this a priori convergence guarantee, obtain a refined characterization of the linearized iteration operator, which yields the optimal rate. As a result, we only need to slightly strengthen the standard $C^{1, 1}$ assumption, which is commonly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Optimization and Variational Analysis · Numerical methods in inverse problems