A Universal Catalyst for First-Order Optimization

Hongzhou Lin; Julien Mairal; Zaid Harchaoui

arXiv:1506.02186·math.OC·October 27, 2015·NeurIPS·186 cites

A Universal Catalyst for First-Order Optimization

Hongzhou Lin, Julien Mairal, Zaid Harchaoui

PDF

Open Access

TL;DR

This paper presents a universal acceleration scheme for a broad class of first-order optimization algorithms, improving convergence rates and practical performance, especially on ill-conditioned problems.

Contribution

It introduces a generic acceleration framework applicable to many algorithms, with theoretical guarantees and practical benefits for non-strongly convex and ill-conditioned problems.

Findings

01

The scheme accelerates various first-order methods.

02

Theoretical speed-up is established for non-strongly convex objectives.

03

Practical improvements are demonstrated on ill-conditioned problems.

Abstract

We introduce a generic scheme for accelerating first-order optimization methods in the sense of Nesterov, which builds upon a new analysis of the accelerated proximal point algorithm. Our approach consists of minimizing a convex objective by approximately solving a sequence of well-chosen auxiliary problems, leading to faster convergence. This strategy applies to a large class of algorithms, including gradient descent, block coordinate descent, SAG, SAGA, SDCA, SVRG, Finito/MISO, and their proximal variants. For all of these methods, we provide acceleration and explicit support for non-strongly convex objectives. In addition to theoretical speed-up, we also show that acceleration is useful in practice, especially for ill-conditioned problems where we measure significant improvements.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Complexity and Algorithms in Graphs