Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning
Jiayu Chen, Zelai Xu, Yunfei Li, Chao Yu, Jiaming Song, Huazhong Yang,, Fei Fang, Yu Wang, Yi Wu

TL;DR
This paper introduces SACL, a novel subgame curriculum learning framework that accelerates multi-agent reinforcement learning in zero-sum games by adaptively selecting subgames, leading to faster convergence and stronger policies.
Contribution
The paper proposes a new subgame curriculum learning method, SACL, which uses an adaptive initial state distribution and a subgame selection metric to improve MARL efficiency in zero-sum games.
Findings
SACL outperforms baseline algorithms in particle-world and Google Research Football environments.
SACL achieves emergent stages and reduces sample complexity in hide-and-seek quadrants.
SACL enhances policy strength and learning speed in complex zero-sum games.
Abstract
Learning Nash equilibrium (NE) in complex zero-sum games with multi-agent reinforcement learning (MARL) can be extremely computationally expensive. Curriculum learning is an effective way to accelerate learning, but an under-explored dimension for generating a curriculum is the difficulty-to-learn of the subgames -- games induced by starting from a specific state. In this work, we present a novel subgame curriculum learning framework for zero-sum games. It adopts an adaptive initial state distribution by resetting agents to some previously visited states where they can quickly learn to improve performance. Building upon this framework, we derive a subgame selection metric that approximates the squared distance to NE values and further adopt a particle-based state sampler for subgame generation. Integrating these techniques leads to our new algorithm, Subgame Automatic Curriculum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsArtificial Intelligence in Games · Sports Analytics and Performance · Reinforcement Learning in Robotics
