Faster Rates for the Frank-Wolfe Method over Strongly-Convex Sets
Dan Garber, Elad Hazan

TL;DR
This paper proves that the Frank-Wolfe method achieves a quadratic convergence rate of 1/t^2 over strongly convex sets, significantly improving upon the standard 1/t rate, with applications to various norm balls.
Contribution
It establishes a faster convergence rate of 1/t^2 for the Frank-Wolfe method on strongly convex sets, a notable improvement over the general case.
Findings
Frank-Wolfe converges at rate 1/t^2 over strongly convex sets.
Linear optimization over these sets has closed-form solutions.
Previous fast-rate results follow from this analysis.
Abstract
The Frank-Wolfe method (a.k.a. conditional gradient algorithm) for smooth optimization has regained much interest in recent years in the context of large scale optimization and machine learning. A key advantage of the method is that it avoids projections - the computational bottleneck in many applications - replacing it by a linear optimization step. Despite this advantage, the known convergence rates of the FW method fall behind standard first order methods for most settings of interest. It is an active line of research to derive faster linear optimization-based algorithms for various settings of convex optimization. In this paper we consider the special case of optimization over strongly convex sets, for which we prove that the vanila FW method converges at a rate of . This gives a quadratic improvement in convergence rate compared to the general case, in which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
