A Universal Catalyst for First-Order Optimization
Hongzhou Lin, Julien Mairal, Zaid Harchaoui

TL;DR
This paper presents a universal acceleration scheme for a broad class of first-order optimization algorithms, improving convergence rates and practical performance, especially on ill-conditioned problems.
Contribution
It introduces a generic acceleration framework applicable to many algorithms, with theoretical guarantees and practical benefits for non-strongly convex and ill-conditioned problems.
Findings
The scheme accelerates various first-order methods.
Theoretical speed-up is established for non-strongly convex objectives.
Practical improvements are demonstrated on ill-conditioned problems.
Abstract
We introduce a generic scheme for accelerating first-order optimization methods in the sense of Nesterov, which builds upon a new analysis of the accelerated proximal point algorithm. Our approach consists of minimizing a convex objective by approximately solving a sequence of well-chosen auxiliary problems, leading to faster convergence. This strategy applies to a large class of algorithms, including gradient descent, block coordinate descent, SAG, SAGA, SDCA, SVRG, Finito/MISO, and their proximal variants. For all of these methods, we provide acceleration and explicit support for non-strongly convex objectives. In addition to theoretical speed-up, we also show that acceleration is useful in practice, especially for ill-conditioned problems where we measure significant improvements.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Complexity and Algorithms in Graphs
