Non-Asymptotic Analysis of Stochastic Approximation Algorithms for   Streaming Data

Antoine Godichon-Baggioni (LPSM (UMR\_8001)); Nicklas Werge (LPSM; (UMR\_8001)); Olivier Wintenberger (LPSM (UMR\_8001))

arXiv:2109.07117·cs.LG·April 25, 2023·1 cites

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

Antoine Godichon-Baggioni (LPSM (UMR\_8001)), Nicklas Werge (LPSM, (UMR\_8001)), Olivier Wintenberger (LPSM (UMR\_8001))

PDF

Open Access

TL;DR

This paper develops a non-asymptotic analysis framework for stochastic approximation algorithms in streaming data settings, showing how to accelerate convergence and reduce variance using time-varying mini-batches and averaging techniques.

Contribution

It introduces a novel streaming framework with non-asymptotic convergence rates and demonstrates how to optimize learning rates and averaging for faster, more accurate online learning.

Findings

01

Polyak-Ruppert averaging achieves optimal convergence bounds.

02

Time-varying mini-batches combined with averaging improve variance reduction.

03

Accelerated convergence through adaptive learning rate selection.

Abstract

We introduce a streaming framework for analyzing stochastic approximation/optimization problems. This streaming framework is analogous to solving optimization problems using time-varying mini-batches that arrive sequentially. We provide non-asymptotic convergence rates of various gradient-based algorithms; this includes the famous Stochastic Gradient (SG) descent (a.k.a. Robbins-Monro algorithm), mini-batch SG and time-varying mini-batch SG algorithms, as well as their iterated averages (a.k.a. Polyak-Ruppert averaging). We show i) how to accelerate convergence by choosing the learning rate according to the time-varying mini-batches, ii) that Polyak-Ruppert averaging achieves optimal convergence in terms of attaining the Cramer-Rao lower bound, and iii) how time-varying mini-batches together with Polyak-Ruppert averaging can provide variance reduction and accelerate convergence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Advanced Bandit Algorithms Research