Asynchronous Gradient Play in Zero-Sum Multi-agent Games
Ruicheng Ao, Shicong Cen, Yuejie Chi

TL;DR
This paper investigates asynchronous gradient play in zero-sum multi-agent games with delayed feedbacks, establishing convergence properties and proposing methods for faster and robust equilibrium approximation.
Contribution
It introduces a delay-aware two-timescale learning approach for entropy-regularized gradient methods in zero-sum polymatrix games, with proven convergence under various delay conditions.
Findings
Last iterate of entropy-regularized OMWU converges linearly without delays.
Convergence persists under random delays with slower rates.
Two-timescale learning accelerates convergence under fixed delays.
Abstract
Finding equilibria via gradient play in competitive multi-agent games has been attracting a growing amount of attention in recent years, with emphasis on designing efficient strategies where the agents operate in a decentralized and symmetric manner with guaranteed convergence. While significant efforts have been made in understanding zero-sum two-player matrix games, the performance in zero-sum multi-agent games remains inadequately explored, especially in the presence of delayed feedbacks, leaving the scalability and resiliency of gradient play open to questions. In this paper, we make progress by studying asynchronous gradient plays in zero-sum polymatrix games under delayed feedbacks. We first establish that the last iterate of entropy-regularized optimistic multiplicative weight updates (OMWU) method converges linearly to the quantal response equilibrium (QRE), the solution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGame Theory and Applications · Mathematical and Theoretical Epidemiology and Ecology Models · Reinforcement Learning in Robotics
