On the Saturation Phenomenon of Stochastic Gradient Descent for Linear   Inverse Problems

Bangti Jin; Zehui Zhou; Jun Zou

arXiv:2010.10916·math.OC·August 10, 2021

On the Saturation Phenomenon of Stochastic Gradient Descent for Linear Inverse Problems

Bangti Jin, Zehui Zhou, Jun Zou

PDF

Open Access

TL;DR

This paper refines the understanding of stochastic gradient descent (SGD) for linear inverse problems, showing that saturation in convergence rates can be avoided with a small enough initial stepsize, supported by theoretical analysis and experiments.

Contribution

It provides a refined convergence rate analysis of SGD, demonstrating that saturation does not occur if the initial stepsize is sufficiently small.

Findings

01

Saturation phenomenon can be avoided with small initial stepsize.

02

Refined convergence rates are established for SGD.

03

Numerical experiments support the theoretical results.

Abstract

Stochastic gradient descent (SGD) is a promising method for solving large-scale inverse problems, due to its excellent scalability with respect to data size. The current mathematical theory in the lens of regularization theory predicts that SGD with a polynomially decaying stepsize schedule may suffer from an undesirable saturation phenomenon, i.e., the convergence rate does not further improve with the solution regularity index when it is beyond a certain range. In this work, we present a refined convergence rate analysis of SGD, and prove that saturation actually does not occur if the initial stepsize of the schedule is sufficiently small. Several numerical experiments are provided to complement the analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Numerical methods in inverse problems