On the linear convergence of the stochastic gradient method with   constant step-size

Volkan Cevher; Bang Cong Vu

arXiv:1712.01906·math.OC·June 19, 2018·Optim. Lett.·1 cites

On the linear convergence of the stochastic gradient method with constant step-size

Volkan Cevher, Bang Cong Vu

PDF

Open Access

TL;DR

This paper establishes a weaker necessary condition than the strong growth condition for the linear convergence of stochastic gradient methods with constant step-size, and analyzes convergence behavior under perturbations.

Contribution

It introduces a weaker necessary condition for linear convergence of SGM-CS and studies convergence under additive perturbations for PSGM-CS and proximal stochastic gradient methods.

Findings

01

Necessary condition for linear convergence weaker than SGC.

02

Linear convergence to a noise-dominated region under perturbations.

03

Distance to optimal solution proportional to step-size and noise level.

Abstract

The strong growth condition (SGC) is known to be a sufficient condition for linear convergence of the stochastic gradient method using a constant step-size $γ$ (SGM-CS). In this paper, we provide a necessary condition, for the linear convergence of SGM-CS, that is weaker than SGC. Moreover, when this necessary is violated up to a additive perturbation $σ$ , we show that both the projected stochastic gradient method using a constant step-size (PSGM-CS) and the proximal stochastic gradient method exhibit linear convergence to a noise dominated region, whose distance to the optimal solution is proportional to $γ σ$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Sparse and Compressive Sensing Techniques