Concentration bounds for two time scale stochastic approximation

Vivek S. Borkar; Sarath Pattathil

arXiv:1806.10798·math.OC·June 29, 2018·Allerton

Concentration bounds for two time scale stochastic approximation

Vivek S. Borkar, Sarath Pattathil

PDF

TL;DR

This paper derives a high-probability concentration bound for two time scale stochastic approximation algorithms by modeling them as discretizations of singularly perturbed differential equations, extending single time scale results.

Contribution

It introduces a novel concentration bound for two time scale stochastic approximation using Alekseev's formula and martingale inequalities, expanding the theoretical understanding of these algorithms.

Findings

01

Provides a quantifiable high-probability bound for two time scale stochastic approximation.

02

Extends existing results from single to two time scale stochastic approximation.

03

Uses advanced mathematical tools like Alekseev's formula and martingale concentration inequalities.

Abstract

Viewing a two time scale stochastic approximation scheme as a noisy discretization of a singularly perturbed differential equation, we obtain a concentration bound for its iterates that captures its behavior with quantifiable high probability. This uses Alekseev's nonlinear variation of constants formula and a martingale concentration inequality and extends the corresponding results for single time scale stochastic approximation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.