A Note on the Gradient-Evaluation Sequence in Accelerated Gradient Methods
Yan Wu, Yipeng Zhang, Lu Liu, Yuyuan Ouyang

TL;DR
This paper investigates the gradient-evaluation sequence in accelerated gradient methods, proving it can achieve optimal iteration complexity for convex smooth problems, including constrained and non-Euclidean cases.
Contribution
It establishes that the gradient-evaluation sequence in AGD attains the same optimal complexity as the approximate solution sequence, extending results to constrained and non-Euclidean settings.
Findings
Gradient-evaluation sequence achieves O(L/k^2) convergence.
Results hold for constrained and non-Euclidean settings.
Confirms optimal complexity for gradient-evaluation sequence in AGD.
Abstract
Nesterov's accelerated gradient descent method (AGD) is a seminal deterministic first-order method known to achieve the optimal order of iteration complexity for solving convex smooth optimization problems. Two distinct sequences of iterates are included in the description of AGD: gradient evaluations are performed at one sequence, while approximate solutions are selected from the other. The iteration complexity on minimizing objective function value has been well-studied in the literature, but such analysis is almost always performed only at the approximate solution sequence. To the best of our knowledge, for projection-based AGD that solves problems with feasible sets, it is still an open research question whether the gradient evaluation sequence (when treated as approximate solutions) could also achieve the same optimal order of iteration complexity. It is also unknown whether such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research · Optimization and Variational Analysis
