ASVRG: Accelerated Proximal SVRG

Fanhua Shang; Licheng Jiao; Kaiwen Zhou; James Cheng; Yan Ren; Yufei; Jin

arXiv:1810.03105·cs.LG·November 20, 2018·21 cites

ASVRG: Accelerated Proximal SVRG

Fanhua Shang, Licheng Jiao, Kaiwen Zhou, James Cheng, Yan Ren, Yufei, Jin

PDF

Open Access

TL;DR

This paper introduces ASVRG, an accelerated proximal stochastic variance reduction method with a simple design, achieving optimal complexity and extending to mini-batch and non-smooth problems, with empirical results supporting its effectiveness.

Contribution

ASVRG presents a simpler, more efficient acceleration technique for stochastic variance reduction, requiring fewer variables and lower complexity than existing methods.

Findings

01

Achieves optimal oracle complexities for convex and non-convex objectives.

02

Extends to mini-batch and non-smooth optimization problems.

03

Empirically outperforms or matches state-of-the-art stochastic methods.

Abstract

This paper proposes an accelerated proximal stochastic variance reduced gradient (ASVRG) method, in which we design a simple and effective momentum acceleration trick. Unlike most existing accelerated stochastic variance reduction methods such as Katyusha, ASVRG has only one additional variable and one momentum parameter. Thus, ASVRG is much simpler than those methods, and has much lower per-iteration complexity. We prove that ASVRG achieves the best known oracle complexities for both strongly convex and non-strongly convex objectives. In addition, we extend ASVRG to mini-batch and non-smooth settings. We also empirically verify our theoretical results and show that the performance of ASVRG is comparable with, and sometimes even better than that of the state-of-the-art stochastic methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning