Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games
Sihan Zeng, Thinh T. Doan

TL;DR
This paper introduces an accelerated multi-time-scale stochastic approximation algorithm with improved convergence rates, leveraging auxiliary variables to better estimate operators and decouple noise, applicable to reinforcement learning and multi-agent games.
Contribution
It develops a novel accelerated algorithm for multi-time-scale stochastic approximation with optimal convergence, and demonstrates its application to reinforcement learning and multi-agent systems.
Findings
Achieves $ ilde{O}(1/t)$ convergence rate under strong monotonicity.
Effectively controls variance of operator estimates.
Shows improved performance in multi-agent game simulations.
Abstract
Multi-time-scale stochastic approximation is an iterative algorithm for finding the fixed point of a set of coupled operators given their noisy samples. It has been observed that due to the coupling between the decision variables and noisy samples of the operators, the performance of this method decays as increases. In this work, we develop a new accelerated variant of multi-time-scale stochastic approximation, which significantly improves the convergence rates of its standard counterpart. Our key idea is to introduce auxiliary variables to dynamically estimate the operators from their samples, which are then used to update the decision variables. These auxiliary variables help not only to control the variance of the operator estimates but also to decouple the sampling noise and the decision variables. This allows us to select more aggressive step sizes to achieve an optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications
