Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data
Antoine Godichon-Baggioni (LPSM (UMR\_8001)), Nicklas Werge (LPSM, (UMR\_8001)), Olivier Wintenberger (LPSM (UMR\_8001))

TL;DR
This paper develops a non-asymptotic analysis framework for stochastic approximation algorithms in streaming data settings, showing how to accelerate convergence and reduce variance using time-varying mini-batches and averaging techniques.
Contribution
It introduces a novel streaming framework with non-asymptotic convergence rates and demonstrates how to optimize learning rates and averaging for faster, more accurate online learning.
Findings
Polyak-Ruppert averaging achieves optimal convergence bounds.
Time-varying mini-batches combined with averaging improve variance reduction.
Accelerated convergence through adaptive learning rate selection.
Abstract
We introduce a streaming framework for analyzing stochastic approximation/optimization problems. This streaming framework is analogous to solving optimization problems using time-varying mini-batches that arrive sequentially. We provide non-asymptotic convergence rates of various gradient-based algorithms; this includes the famous Stochastic Gradient (SG) descent (a.k.a. Robbins-Monro algorithm), mini-batch SG and time-varying mini-batch SG algorithms, as well as their iterated averages (a.k.a. Polyak-Ruppert averaging). We show i) how to accelerate convergence by choosing the learning rate according to the time-varying mini-batches, ii) that Polyak-Ruppert averaging achieves optimal convergence in terms of attaining the Cramer-Rao lower bound, and iii) how time-varying mini-batches together with Polyak-Ruppert averaging can provide variance reduction and accelerate convergence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Advanced Bandit Algorithms Research
