Neural Auto-Curricula
Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen McAleer, Ying, Wen, Jun Wang, Yaodong Yang

TL;DR
Neural Auto-Curricula (NAC) uses meta-gradient descent to automatically learn opponent selection and response strategies in multi-agent reinforcement learning, outperforming traditional game-theoretic methods on various games without human-designed rules.
Contribution
NAC introduces a neural network-based framework that automates the discovery of opponent selection and response strategies in MARL, eliminating the need for manual game-theoretic design.
Findings
NAC achieves competitive or better performance than state-of-the-art methods.
NAC generalizes from small to large games, outperforming existing algorithms.
NAC discovers effective strategies without explicit human-designed rules.
Abstract
When solving two-player zero-sum games, multi-agent reinforcement learning (MARL) algorithms often create populations of agents where, at each iteration, a new agent is discovered as the best response to a mixture over the opponent population. Within such a process, the update rules of "who to compete with" (i.e., the opponent mixture) and "how to beat them" (i.e., finding best responses) are underpinned by manually developed game theoretical principles such as fictitious play and Double Oracle. In this paper, we introduce a novel framework -- Neural Auto-Curricula (NAC) -- that leverages meta-gradient descent to automate the discovery of the learning update rule without explicit human design. Specifically, we parameterise the opponent selection module by neural networks and the best-response module by optimisation subroutines, and update their parameters solely via interaction with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Artificial Intelligence in Games · Reinforcement Learning in Robotics
