Finite-Time Analysis of Projected Two-Time-Scale Stochastic Approximation

Yitao Bai; Thinh T. Doan; Justin Romberg

arXiv:2604.00179·eess.SY·April 2, 2026

Finite-Time Analysis of Projected Two-Time-Scale Stochastic Approximation

Yitao Bai, Thinh T. Doan, Justin Romberg

PDF

TL;DR

This paper provides a finite-time convergence analysis for projected two-time-scale stochastic approximation, deriving explicit error bounds and illustrating their application in reinforcement learning.

Contribution

It introduces a detailed finite-time error bound for projected two-time-scale stochastic approximation with explicit constants and separates approximation and statistical errors.

Findings

01

Explicit mean-square error bounds are derived.

02

Constants depend on stability margins and coupling invertibility.

03

Numerical experiments validate theoretical results.

Abstract

We study the finite-time convergence of projected linear two-time-scale stochastic approximation with constant step sizes and Polyak--Ruppert averaging. We establish an explicit mean-square error bound, decomposing it into two interpretable components, an approximation error determined by the constrained subspace and a statistical error decaying at a sublinear rate, with constants expressed through restricted stability margins and a coupling invertibility condition. These constants cleanly separate the effect of subspace choice (approximation errors) from the effect of the averaging horizon (statistical errors). We illustrate our theoretical results through a number of numerical experiments on both synthetic and reinforcement learning problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.