Estimate Sequences for Variance-Reduced Stochastic Composite   Optimization

Andrei Kulunchakov (Thoth); Julien Mairal (Thoth)

arXiv:1905.02374·stat.ML·May 8, 2019·20 cites

Estimate Sequences for Variance-Reduced Stochastic Composite Optimization

Andrei Kulunchakov (Thoth), Julien Mairal (Thoth)

PDF

Open Access

TL;DR

This paper introduces a unified framework for analyzing and developing stochastic gradient algorithms for convex composite optimization, extending Nesterov's estimate sequences to improve convergence proofs, adaptivity, robustness, and acceleration.

Contribution

It extends the concept of estimate sequences to stochastic composite optimization, providing a unified analysis, new algorithms, and strategies for robustness and acceleration.

Findings

01

Provided a generic convergence proof for stochastic gradient methods.

02

Developed an adaptive SVRG variant for strong convexity.

03

Derived new accelerated algorithms based on the estimate sequence framework.

Abstract

In this paper, we propose a unified view of gradient-based algorithms for stochastic convex composite optimization by extending the concept of estimate sequence introduced by Nesterov. This point of view covers the stochastic gradient descent method, variants of the approaches SAGA, SVRG, and has several advantages: (i) we provide a generic proof of convergence for the aforementioned methods; (ii) we show that this SVRG variant is adaptive to strong convexity; (iii) we naturally obtain new algorithms with the same guarantees; (iv) we derive generic strategies to make these algorithms robust to stochastic noise, which is useful when data is corrupted by small random perturbations. Finally, we show that this viewpoint is useful to obtain new accelerated algorithms in the sense of Nesterov.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods

MethodsSAGA