SpiderBoost and Momentum: Faster Stochastic Variance Reduction Algorithms
Zhe Wang, Kaiyi Ji, Yi Zhou, Yingbin Liang, Vahid Tarokh

TL;DR
This paper introduces SpiderBoost, an improved stochastic variance reduction algorithm that uses larger stepsizes, handles nonsmooth regularizers, and accelerates convergence in nonconvex optimization, with theoretical guarantees and practical improvements.
Contribution
SpiderBoost extends SPIDER with larger stepsizes and proximal mapping, achieving near-optimal complexity and improved practical performance in nonsmooth nonconvex optimization.
Findings
SpiderBoost achieves an oracle complexity of O(min{n^{1/2}ε^{-2}, ε^{-3}}).
Proposed momentum scheme accelerates convergence in experiments.
Improves state-of-the-art complexity bounds for composite nonconvex optimization.
Abstract
SARAH and SPIDER are two recently developed stochastic variance-reduced algorithms, and SPIDER has been shown to achieve a near-optimal first-order oracle complexity in smooth nonconvex optimization. However, SPIDER uses an accuracy-dependent stepsize that slows down the convergence in practice, and cannot handle objective functions that involve nonsmooth regularizers. In this paper, we propose SpiderBoost as an improved scheme, which allows to use a much larger constant-level stepsize while maintaining the same near-optimal oracle complexity, and can be extended with proximal mapping to handle composite optimization (which is nonsmooth and nonconvex) with provable convergence guarantee. In particular, we show that proximal SpiderBoost achieves an oracle complexity of in composite nonconvex optimization, improving the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Generative Adversarial Networks and Image Synthesis
