A unified high-resolution ODE framework for first-order methods
Lixia Wang, Hao Luo

TL;DR
This paper introduces a high-resolution ODE framework for analyzing accelerated first-order methods with momentum, providing new insights into their convergence and proposing modifications for improved performance.
Contribution
It develops a novel $O((\sqrt{s})^r)$-resolution ODE framework that accommodates momentum and variable parameters, extending previous models that required fixed point assumptions.
Findings
Deepens understanding of convergence properties of accelerated methods.
Identifies differences in ODEs for HB and NAG at $O(\sqrt{s})$-resolution.
Proposes modifications with proven convergence rates.
Abstract
For a generic discrete-time algorithm (DTA): , where is the step size, Lu (Math. Program., 194(1):1061--1112, 2022) proposed an -resolution ordinary differential equation (ODE) framework based on the backward error analysis, which can be used to analyze many DTAs satisfying the fixed point assumption such as gradient descent, extra gradient method and primal-dual hybrid gradient (PDHG). However, most first-order methods with momentum violate this critical assumption. To address this issue, in this work, we introduce a novel -resolution ODE framework for accelerated first-order methods allowing momentum and variable parameters, such as Nesterov accelerated gradient (NAG), heavy-ball (HB) method and accelerated mirror gradient. The proposed high-resolution framework provides deeper insight into the convergence properties of DTAs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical methods for differential equations · Stochastic Gradient Optimization Techniques · Model Reduction and Neural Networks
