Stochastic Approximation Beyond Gradient for Signal Processing and   Machine Learning

Aymeric Dieuleveut; Gersende Fort; Eric Moulines; Hoi-To Wai

arXiv:2302.11147·math.OC·July 18, 2023·IEEE Trans. Signal Process.

Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning

Aymeric Dieuleveut, Gersende Fort, Eric Moulines, Hoi-To Wai

PDF

TL;DR

This paper broadens the scope of stochastic approximation (SA) beyond traditional gradient methods, providing a unified theoretical framework for non-gradient algorithms in signal processing and machine learning, with convergence analysis and practical applications.

Contribution

It introduces a general framework unifying theories of SA, including non-gradient algorithms, supported by Lyapunov function analysis, and demonstrates their convergence properties and applications.

Findings

01

Unified convergence analysis for non-gradient SA algorithms.

02

Connections established between Lyapunov functions and algorithm properties.

03

Extensions discussed for variance reduction techniques.

Abstract

Stochastic Approximation (SA) is a classical algorithm that has had since the early days a huge impact on signal processing, and nowadays on machine learning, due to the necessity to deal with a large amount of data observed with uncertainties. An exemplar special case of SA pertains to the popular stochastic (sub)gradient algorithm which is the working horse behind many important applications. A lesser-known fact is that the SA scheme also extends to non-stochastic-gradient algorithms such as compressed stochastic gradient, stochastic expectation-maximization, and a number of reinforcement learning algorithms. The aim of this article is to overview and introduce the non-stochastic-gradient perspectives of SA to the signal processing and machine learning audiences through presenting a design guideline of SA algorithms backed by theories. Our central theme is to propose a general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.