Generalized Continuous-Time Models for Nesterov's Accelerated Gradient Methods
Chanwoong Park, Youngchae Cho, Insoon Yang

TL;DR
This paper introduces a unified continuous-time modeling framework for Nesterov's accelerated gradient methods, enabling comprehensive analysis, convergence rate determination, and the development of a generalized restart scheme with broad applicability.
Contribution
It presents a generalized continuous-time model that unifies existing models, determines convergence rates, and proposes a new restart scheme applicable to a wider class of Nesterov's methods.
Findings
Six existing models are special cases of the generalized model.
The generalized models achieve accelerated convergence rates via time reparametrization.
The proposed restart scheme ensures monotonic decrease in objective function values.
Abstract
Recent research has indicated a substantial rise in interest in understanding Nesterov's accelerated gradient methods via their continuous-time models. However, most existing studies focus on specific classes of Nesterov's methods, which hinders the attainment of an in-depth understanding and a unified perspective. To address this deficit, we present generalized continuous-time models that cover a broad range of Nesterov's methods, including those previously studied under existing continuous-time frameworks. Our key contributions are as follows. First, we identify the convergence rates of the generalized models, eliminating the need to determine the convergence rate for any specific continuous-time model derived from them. Second, we show that six existing continuous-time models are special cases of our generalized models, thereby positioning our framework as a unifying tool for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Stochastic Gradient Optimization Techniques · Neural Networks and Applications
MethodsFocus
