A Variational Perspective on Accelerated Methods in Optimization

Andre Wibisono; Ashia C. Wilson; Michael I. Jordan

arXiv:1603.04245·math.OC·June 8, 2022

A Variational Perspective on Accelerated Methods in Optimization

Andre Wibisono, Ashia C. Wilson, Michael I. Jordan

PDF

TL;DR

This paper introduces a unified continuous-time framework using the Bregman Lagrangian to understand and generate a broad class of accelerated optimization methods, revealing their common geometric structure.

Contribution

It presents the Bregman Lagrangian as a unifying continuous-time perspective for various accelerated methods, including their derivation and generalizations.

Findings

01

All accelerated methods follow the same continuous-time curve at different speeds.

02

Nesterov's acceleration can be viewed as discretizing a continuous-time trajectory.

03

The framework encompasses Euclidean and non-Euclidean accelerated methods.

Abstract

Accelerated gradient methods play a central role in optimization, achieving optimal rates in many settings. While many generalizations and extensions of Nesterov's original acceleration method have been proposed, it is not yet clear what is the natural scope of the acceleration concept. In this paper, we study accelerated methods from a continuous-time perspective. We show that there is a Lagrangian functional that we call the \emph{Bregman Lagrangian} which generates a large class of accelerated methods in continuous time, including (but not limited to) accelerated gradient descent, its non-Euclidean extension, and accelerated higher-order gradient methods. We show that the continuous-time limit of all of these methods correspond to traveling the same curve in spacetime at different speeds. From this perspective, Nesterov's technique and many of its generalizations can be viewed as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.