Accelerated Multi-Time-Scale Stochastic Approximation: Optimal   Complexity and Applications in Reinforcement Learning and Multi-Agent Games

Sihan Zeng; Thinh T. Doan

arXiv:2409.07767·math.OC·September 13, 2024

Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games

Sihan Zeng, Thinh T. Doan

PDF

Open Access

TL;DR

This paper introduces an accelerated multi-time-scale stochastic approximation algorithm with improved convergence rates, leveraging auxiliary variables to better estimate operators and decouple noise, applicable to reinforcement learning and multi-agent games.

Contribution

It develops a novel accelerated algorithm for multi-time-scale stochastic approximation with optimal convergence, and demonstrates its application to reinforcement learning and multi-agent systems.

Findings

01

Achieves $ ilde{O}(1/t)$ convergence rate under strong monotonicity.

02

Effectively controls variance of operator estimates.

03

Shows improved performance in multi-agent game simulations.

Abstract

Multi-time-scale stochastic approximation is an iterative algorithm for finding the fixed point of a set of $N$ coupled operators given their noisy samples. It has been observed that due to the coupling between the decision variables and noisy samples of the operators, the performance of this method decays as $N$ increases. In this work, we develop a new accelerated variant of multi-time-scale stochastic approximation, which significantly improves the convergence rates of its standard counterpart. Our key idea is to introduce auxiliary variables to dynamically estimate the operators from their samples, which are then used to update the decision variables. These auxiliary variables help not only to control the variance of the operator estimates but also to decouple the sampling noise and the decision variables. This allows us to select more aggressive step sizes to achieve an optimal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications