Convergence of First-Order Algorithms with Momentum from the Perspective of an Inexact Gradient Descent Method
Pham Duy Khanh, Boris Mordukhovich, Dat Ba Tran

TL;DR
This paper presents a unified inexact gradient descent framework with momentum, analyzing its convergence properties and demonstrating its effectiveness through numerical experiments on derivative-free optimization problems.
Contribution
It introduces a general inexact gradient descent method with momentum that unifies various algorithms and provides comprehensive convergence analysis under different function conditions.
Findings
Global convergence of EGm and SAMm for smooth functions
Local convergence for locally smooth and prox-regular functions
Numerical experiments confirm efficiency of momentum in inexact gradient methods
Abstract
This paper introduces a novel inexact gradient descent method with momentum (IGDm) considered as a general framework for various first-order methods with momentum. This includes, in particular, the inexact proximal point method (IPPm), extragradient method (EGm), and sharpness-aware minimization (SAMm). Asymptotic convergence properies of IGDm are established under both global and local assumptions on objective functions with providing constructive convergence rates depending on the Polyak-\L ojasiewicz-Kurdyka (PLK) conditions for the objective function. Global convergence of EGm and SAMm for general smooth functions and of IPPM for weakly convex functions is derived in this way. Moreover, local convergence properties of EGm and SAMm for locally smooth functions as well as of IPPm for prox-regular functions are established. Numerical experiments for derivative-free optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical methods in inverse problems · Iterative Methods for Nonlinear Equations · Radiative Heat Transfer Studies
