Boosting Frank-Wolfe by Chasing Gradients
Cyrille W. Combettes, Sebastian Pokutta

TL;DR
This paper introduces an enhanced Frank-Wolfe algorithm that accelerates convergence by better aligning descent directions with the negative gradient, achieving faster rates and improved computational efficiency.
Contribution
The paper proposes a novel subroutine that chases the negative gradient to speed up Frank-Wolfe while maintaining its projection-free nature.
Findings
Convergence rate improved from O(1/t) to exponential decay.
Method outperforms state-of-the-art in CPU time and iteration efficiency.
Significant practical speedups demonstrated in experiments.
Abstract
The Frank-Wolfe algorithm has become a popular first-order optimization algorithm for it is simple and projection-free, and it has been successfully applied to a variety of real-world problems. Its main drawback however lies in its convergence rate, which can be excessively slow due to naive descent directions. We propose to speed up the Frank-Wolfe algorithm by better aligning the descent direction with that of the negative gradient via a subroutine. This subroutine chases the negative gradient direction in a matching pursuit-style while still preserving the projection-free property. Although the approach is reasonably natural, it produces very significant results. We derive convergence rates to of our method and we demonstrate its competitive advantage both per iteration and in CPU time over the state-of-the-art in a series of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
