On The Convergence of Euler Discretization of Finite-Time Convergent Gradient Flows
Siqi Zhang, Mouhacine Benosman, Orlando Romero

TL;DR
This paper introduces and analyzes two novel first-order optimization algorithms, RGF and SGF, derived from discretizing finite-time convergent flows, demonstrating their effectiveness in faster convergence for various problems including neural network training.
Contribution
The paper proposes two new optimization algorithms based on discretized finite-time convergent flows, with convergence guarantees and enhanced performance in practice.
Findings
RGF and SGF algorithms converge faster than standard methods.
The algorithms perform well in both deterministic and stochastic settings.
Application to neural networks shows improved training speed.
Abstract
In this study, we investigate the performance of two novel first-order optimization algorithms, namely the rescaled-gradient flow (RGF) and the signed-gradient flow (SGF). These algorithms are derived from the forward Euler discretization of finite-time convergent flows, comprised of non-Lipschitz dynamical systems, which locally converge to the minima of gradient-dominated functions. We first characterize the closeness between the continuous flows and the discretizations, then we proceed to present (linear) convergence guarantees of the discrete algorithms (in the general and the stochastic case). Furthermore, in cases where problem parameters remain unknown or exhibit non-uniformity, we further integrate the line-search strategy with RGF/SGF and provide convergence analysis in this setting. We then apply the proposed algorithms to academic examples and deep neural network training,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Sparse and Compressive Sensing Techniques
