On the linear convergence of the stochastic gradient method with constant step-size
Volkan Cevher, Bang Cong Vu

TL;DR
This paper establishes a weaker necessary condition than the strong growth condition for the linear convergence of stochastic gradient methods with constant step-size, and analyzes convergence behavior under perturbations.
Contribution
It introduces a weaker necessary condition for linear convergence of SGM-CS and studies convergence under additive perturbations for PSGM-CS and proximal stochastic gradient methods.
Findings
Necessary condition for linear convergence weaker than SGC.
Linear convergence to a noise-dominated region under perturbations.
Distance to optimal solution proportional to step-size and noise level.
Abstract
The strong growth condition (SGC) is known to be a sufficient condition for linear convergence of the stochastic gradient method using a constant step-size (SGM-CS). In this paper, we provide a necessary condition, for the linear convergence of SGM-CS, that is weaker than SGC. Moreover, when this necessary is violated up to a additive perturbation , we show that both the projected stochastic gradient method using a constant step-size (PSGM-CS) and the proximal stochastic gradient method exhibit linear convergence to a noise dominated region, whose distance to the optimal solution is proportional to .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Sparse and Compressive Sensing Techniques
