Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning
Aymeric Dieuleveut, Gersende Fort, Eric Moulines, Hoi-To Wai

TL;DR
This paper broadens the scope of stochastic approximation (SA) beyond traditional gradient methods, providing a unified theoretical framework for non-gradient algorithms in signal processing and machine learning, with convergence analysis and practical applications.
Contribution
It introduces a general framework unifying theories of SA, including non-gradient algorithms, supported by Lyapunov function analysis, and demonstrates their convergence properties and applications.
Findings
Unified convergence analysis for non-gradient SA algorithms.
Connections established between Lyapunov functions and algorithm properties.
Extensions discussed for variance reduction techniques.
Abstract
Stochastic Approximation (SA) is a classical algorithm that has had since the early days a huge impact on signal processing, and nowadays on machine learning, due to the necessity to deal with a large amount of data observed with uncertainties. An exemplar special case of SA pertains to the popular stochastic (sub)gradient algorithm which is the working horse behind many important applications. A lesser-known fact is that the SA scheme also extends to non-stochastic-gradient algorithms such as compressed stochastic gradient, stochastic expectation-maximization, and a number of reinforcement learning algorithms. The aim of this article is to overview and introduce the non-stochastic-gradient perspectives of SA to the signal processing and machine learning audiences through presenting a design guideline of SA algorithms backed by theories. Our central theme is to propose a general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
