FLAG n' FLARE: Fast Linearly-Coupled Adaptive Gradient Methods

Xiang Cheng; Farbod Roosta-Khorasani; Stefan Palombo; Peter L.; Bartlett; Michael W. Mahoney

arXiv:1605.08108·math.OC·November 15, 2017

FLAG n' FLARE: Fast Linearly-Coupled Adaptive Gradient Methods

Xiang Cheng, Farbod Roosta-Khorasani, Stefan Palombo, Peter L., Bartlett, Michael W. Mahoney

PDF

Open Access

TL;DR

FLAG and FLARE are accelerated, adaptive gradient methods that optimize composite objectives efficiently by combining the best features of acceleration and adaptivity, suitable for various machine learning tasks.

Contribution

Introduction of FLAG and FLARE, novel gradient methods that achieve optimal convergence rates while adaptively re-scaling gradients based on domain geometry.

Findings

01

Achieve optimal convergence rate for smooth convex optimization.

02

Effectively adapt to the geometry of the domain.

03

Show superior empirical performance in data fitting tasks.

Abstract

We consider first order gradient methods for effectively optimizing a composite objective in the form of a sum of smooth and, potentially, non-smooth functions. We present accelerated and adaptive gradient methods, called FLAG and FLARE, which can offer the best of both worlds. They can achieve the optimal convergence rate by attaining the optimal first-order oracle complexity for smooth convex optimization. Additionally, they can adaptively and non-uniformly re-scale the gradient direction to adapt to the limited curvature available and conform to the geometry of the domain. We show theoretically and empirically that, through the compounding effects of acceleration and adaptivity, FLAG and FLARE can be highly effective for many data fitting and machine learning applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Medical Image Segmentation Techniques