Local Quadratic Convergence of Stochastic Gradient Descent with Adaptive   Step Size

Adityanarayanan Radhakrishnan; Mikhail Belkin; Caroline Uhler

arXiv:2112.14872·math.OC·January 3, 2022

Local Quadratic Convergence of Stochastic Gradient Descent with Adaptive Step Size

Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

PDF

Open Access

TL;DR

This paper proves that stochastic gradient descent with adaptive step size can achieve local quadratic convergence for certain problems, enhancing understanding of its efficiency in practical machine learning tasks.

Contribution

It is the first to establish local quadratic convergence of adaptive stochastic gradient descent methods for specific problems like matrix inversion.

Findings

01

SGD with adaptive step size achieves local quadratic convergence.

02

Theoretical results apply to problems such as matrix inversion.

03

Enhances understanding of convergence behavior in adaptive stochastic optimization.

Abstract

Establishing a fast rate of convergence for optimization methods is crucial to their applicability in practice. With the increasing popularity of deep learning over the past decade, stochastic gradient descent and its adaptive variants (e.g. Adagrad, Adam, etc.) have become prominent methods of choice for machine learning practitioners. While a large number of works have demonstrated that these first order optimization methods can achieve sub-linear or linear convergence, we establish local quadratic convergence for stochastic gradient descent with adaptive step size for problems such as matrix inversion.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM

MethodsAdam