Gradient Norm Minimization of Nesterov Acceleration: $o(1/k^3)$
Shuo Chen, Bin Shi, Ya-xiang Yuan

TL;DR
This paper investigates the acceleration mechanism of Nesterov's gradient descent, introduces a new high-resolution differential equation framework, and proves that gradient norm minimization can achieve a rate faster than $o(1/k^3)$.
Contribution
It provides a simplified proof of NAG's acceleration, introduces an implicit-velocity high-resolution framework, and establishes a faster convergence rate for gradient norm minimization.
Findings
Proves gradient norm minimization of NAG can achieve $o(1/k^3)$ rate.
Introduces a new implicit-velocity high-resolution differential equation framework.
Shows a faster $o(1/k^2)$ rate for objective value minimization when $r > 2$.
Abstract
In the history of first-order algorithms, Nesterov's accelerated gradient descent (NAG) is one of the milestones. However, the cause of the acceleration has been a mystery for a long time. It has not been revealed with the existence of gradient correction until the high-resolution differential equation framework proposed in [Shi et al., 2021]. In this paper, we continue to investigate the acceleration phenomenon. First, we provide a significantly simplified proof based on precise observation and a tighter inequality for -smooth functions. Then, a new implicit-velocity high-resolution differential equation framework, as well as the corresponding implicit-velocity version of phase-space representation and Lyapunov function, is proposed to investigate the convergence behavior of the iterative sequence of NAG. Furthermore, from two kinds of phase-space…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Medical Imaging Techniques and Applications · Sparse and Compressive Sensing Techniques
