A Multistep Frank-Wolfe Method
Zhaoyue Chen, Yifan Sun

TL;DR
This paper introduces multistep variants of the Frank-Wolfe algorithm to address its slow convergence due to zig-zagging, analyzing their stability and convergence properties.
Contribution
It proposes multistep Frank-Wolfe methods with improved stability and analyzes their convergence, highlighting limitations of Runge-Kutta schemes.
Findings
Multistep Frank-Wolfe variants stabilize the algorithm.
Runge-Kutta schemes do not outperform vanilla Frank-Wolfe in worst-case convergence.
Analysis provides insights into flow analysis for optimization methods.
Abstract
The Frank-Wolfe algorithm has regained much interest in its use in structurally constrained machine learning applications. However, one major limitation of the Frank-Wolfe algorithm is the slow local convergence property due to the zig-zagging behavior. We observe the zig-zagging phenomenon in the Frank-Wolfe method as an artifact of discretization, and propose multistep Frank-Wolfe variants where the truncation errors decay as , where is the method's order. This strategy "stabilizes" the method, and allows tools like line search and momentum to have more benefits. However, our results suggest that the worst case convergence rate of Runge-Kutta-type discretization schemes cannot improve upon that of the vanilla Frank-Wolfe method for a rate depending on . Still, we believe that this analysis adds to the growing knowledge of flow analysis for optimization methods, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms
