Neural Auto-Curricula

Xidong Feng; Oliver Slumbers; Ziyu Wan; Bo Liu; Stephen McAleer; Ying; Wen; Jun Wang; Yaodong Yang

arXiv:2106.02745·cs.AI·November 2, 2021

Neural Auto-Curricula

Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen McAleer, Ying, Wen, Jun Wang, Yaodong Yang

PDF

Open Access 1 Repo

TL;DR

Neural Auto-Curricula (NAC) uses meta-gradient descent to automatically learn opponent selection and response strategies in multi-agent reinforcement learning, outperforming traditional game-theoretic methods on various games without human-designed rules.

Contribution

NAC introduces a neural network-based framework that automates the discovery of opponent selection and response strategies in MARL, eliminating the need for manual game-theoretic design.

Findings

01

NAC achieves competitive or better performance than state-of-the-art methods.

02

NAC generalizes from small to large games, outperforming existing algorithms.

03

NAC discovers effective strategies without explicit human-designed rules.

Abstract

When solving two-player zero-sum games, multi-agent reinforcement learning (MARL) algorithms often create populations of agents where, at each iteration, a new agent is discovered as the best response to a mixture over the opponent population. Within such a process, the update rules of "who to compete with" (i.e., the opponent mixture) and "how to beat them" (i.e., finding best responses) are underpinned by manually developed game theoretical principles such as fictitious play and Double Oracle. In this paper, we introduce a novel framework -- Neural Auto-Curricula (NAC) -- that leverages meta-gradient descent to automate the discovery of the learning update rule without explicit human design. Specifically, we parameterise the opponent selection module by neural networks and the best-response module by optimisation subroutines, and update their parameters solely via interaction with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

waterhorse1/nac
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSports Analytics and Performance · Artificial Intelligence in Games · Reinforcement Learning in Robotics