FLAG n' FLARE: Fast Linearly-Coupled Adaptive Gradient Methods
Xiang Cheng, Farbod Roosta-Khorasani, Stefan Palombo, Peter L., Bartlett, Michael W. Mahoney

TL;DR
FLAG and FLARE are accelerated, adaptive gradient methods that optimize composite objectives efficiently by combining the best features of acceleration and adaptivity, suitable for various machine learning tasks.
Contribution
Introduction of FLAG and FLARE, novel gradient methods that achieve optimal convergence rates while adaptively re-scaling gradients based on domain geometry.
Findings
Achieve optimal convergence rate for smooth convex optimization.
Effectively adapt to the geometry of the domain.
Show superior empirical performance in data fitting tasks.
Abstract
We consider first order gradient methods for effectively optimizing a composite objective in the form of a sum of smooth and, potentially, non-smooth functions. We present accelerated and adaptive gradient methods, called FLAG and FLARE, which can offer the best of both worlds. They can achieve the optimal convergence rate by attaining the optimal first-order oracle complexity for smooth convex optimization. Additionally, they can adaptively and non-uniformly re-scale the gradient direction to adapt to the limited curvature available and conform to the geometry of the domain. We show theoretically and empirically that, through the compounding effects of acceleration and adaptivity, FLAG and FLARE can be highly effective for many data fitting and machine learning applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Medical Image Segmentation Techniques
