Gradient Methods with Memory

Yurii Nesterov; Mihai I. Florea

arXiv:2105.09241·math.OC·June 2, 2021·Optim. Methods Softw.

Gradient Methods with Memory

Yurii Nesterov, Mihai I. Florea

PDF

TL;DR

This paper introduces gradient methods with memory that leverage previous iteration information to accelerate convergence in smooth convex minimization, combining improved theoretical guarantees with efficient computations.

Contribution

The paper proposes a novel class of gradient methods with memory for convex functions, achieving faster convergence and computational efficiency compared to standard gradient methods.

Findings

01

Methods outperform traditional gradient methods in oracle calls

02

New algorithms demonstrate improved convergence rates

03

Preliminary experiments confirm theoretical advantages

Abstract

In this paper, we consider gradient methods for minimizing smooth convex functions, which employ the information obtained at the previous iterations in order to accelerate the convergence towards the optimal solution. This information is used in the form of a piece-wise linear model of the objective function, which provides us with much better prediction abilities as compared with the standard linear model. To the best of our knowledge, this approach was never really applied in Convex Minimization to differentiable functions in view of the high complexity of the corresponding auxiliary problems. However, we show that all necessary computations can be done very efficiently. Consequently, we get new optimization methods, which are better than the usual Gradient Methods both in the number of oracle calls and in the computational time. Our theoretical conclusions are confirmed by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.