A Universally Optimal Multistage Accelerated Stochastic Gradient Method

Necdet Serhat Aybat; Alireza Fallah; Mert Gurbuzbalaban; Asuman; Ozdaglar

arXiv:1901.08022·math.OC·October 29, 2019·20 cites

A Universally Optimal Multistage Accelerated Stochastic Gradient Method

Necdet Serhat Aybat, Alireza Fallah, Mert Gurbuzbalaban, Asuman, Ozdaglar

PDF

Open Access

TL;DR

This paper introduces a multistage accelerated stochastic gradient method that achieves optimal convergence rates in both deterministic and stochastic settings without prior noise knowledge.

Contribution

A novel multistage accelerated algorithm that is universally optimal, combining stochastic Nesterov's method with specific restarts and parameter tuning.

Findings

01

Achieves optimal convergence rates in both deterministic and stochastic cases.

02

Operates without prior knowledge of noise characteristics.

03

Uses staged stochastic Nesterov's method with tailored restarts.

Abstract

We study the problem of minimizing a strongly convex, smooth function when we have noisy estimates of its gradient. We propose a novel multistage accelerated algorithm that is universally optimal in the sense that it achieves the optimal rate both in the deterministic and stochastic case and operates without knowledge of noise characteristics. The algorithm consists of stages that use a stochastic version of Nesterov's method with a specific restart and parameters selected to achieve the fastest reduction in the bias-variance terms in the convergence rate bounds.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Statistical Methods and Inference