Boosting Frank-Wolfe by Chasing Gradients

Cyrille W. Combettes; Sebastian Pokutta

arXiv:2003.06369·math.OC·June 25, 2020·6 cites

Boosting Frank-Wolfe by Chasing Gradients

Cyrille W. Combettes, Sebastian Pokutta

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces an enhanced Frank-Wolfe algorithm that accelerates convergence by better aligning descent directions with the negative gradient, achieving faster rates and improved computational efficiency.

Contribution

The paper proposes a novel subroutine that chases the negative gradient to speed up Frank-Wolfe while maintaining its projection-free nature.

Findings

01

Convergence rate improved from O(1/t) to exponential decay.

02

Method outperforms state-of-the-art in CPU time and iteration efficiency.

03

Significant practical speedups demonstrated in experiments.

Abstract

The Frank-Wolfe algorithm has become a popular first-order optimization algorithm for it is simple and projection-free, and it has been successfully applied to a variety of real-world problems. Its main drawback however lies in its convergence rate, which can be excessively slow due to naive descent directions. We propose to speed up the Frank-Wolfe algorithm by better aligning the descent direction with that of the negative gradient via a subroutine. This subroutine chases the negative gradient direction in a matching pursuit-style while still preserving the projection-free property. Although the approach is reasonably natural, it produces very significant results. We derive convergence rates $O (1/ t)$ to $O (e^{- ω t})$ of our method and we demonstrate its competitive advantage both per iteration and in CPU time over the state-of-the-art in a series of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cyrillewcombettes/boostfw
noneOfficial

Videos

Boosting Frank-Wolfe by Chasing Gradients· slideslive

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings