Linear Convergence of Stochastic Frank Wolfe Variants
Donald Goldfarb, Garud Iyengar, Chaoxu Zhou

TL;DR
This paper proves linear convergence in expectation and almost surely for stochastic Frank-Wolfe variants, introducing a novel proof technique and demonstrating competitive performance in large-scale experiments.
Contribution
It establishes the first linear convergence results for ASFW and PSFW algorithms using a new proof approach based on empirical processes.
Findings
ASFW and PSFW converge linearly in expectation.
Linear convergence implies almost sure convergence.
Algorithms perform well in large-scale numerical experiments.
Abstract
In this paper, we show that the Away-step Stochastic Frank-Wolfe Algorithm (ASFW) and Pairwise Stochastic Frank-Wolfe algorithm (PSFW) converge linearly in expectation. We also show that if an algorithm convergences linearly in expectation then it converges linearly almost surely. In order to prove these results, we develop a novel proof technique based on concepts of empirical processes and concentration inequalities. Such a technique has rarely been used to derive the convergence rates of stochastic optimization algorithms. In large-scale numerical experiments, ASFW and PSFW perform as well as or better than their stochastic competitors in actual CPU time.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Neural Networks and Applications · Markov Chains and Monte Carlo Methods
