An Empirical Study on Google Research Football Multi-agent Scenarios
Yan Song, He Jiang, Zheng Tian, Haifeng Zhang, Yingping Zhang,, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang, Jun Wang,

TL;DR
This paper presents a comprehensive empirical study on multi-agent reinforcement learning in Google Research Football, introducing a new training pipeline, open-source framework, and benchmarks to advance research in full-game scenarios.
Contribution
It provides a population-based training pipeline, open-source framework, and benchmark results for multi-agent football scenarios, filling a gap in existing research and tools.
Findings
Outperforms difficulty 1.0 bots within 2 million steps
Provides a reference for Independent Proximal Policy Optimization (IPPO) performance
Open-sources training framework Light-MALib with analytical tools
Abstract
Few multi-agent reinforcement learning (MARL) research on Google Research Football (GRF) focus on the 11v11 multi-agent full-game scenario and to the best of our knowledge, no open benchmark on this scenario has been released to the public. In this work, we fill the gap by providing a population-based MARL training pipeline and hyperparameter settings on multi-agent football scenario that outperforms the bot with difficulty 1.0 from scratch within 2 million steps. Our experiments serve as a reference for the expected performance of Independent Proximal Policy Optimization (IPPO), a state-of-the-art multi-agent reinforcement learning algorithm where each agent tries to maximize its own policy independently across various training configurations. Meanwhile, we open-source our training framework Light-MALib which extends the MALib codebase by distributed and asynchronized implementation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Games and Gamification · Reinforcement Learning in Robotics · Data Stream Mining Techniques
MethodsSix Ways To Communicate To Someone At Expedia Via Phone And Email's. · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · 1x1 Convolution · Feedforward Network · Two Time-scale Update Rule · Projection Discriminator · Non-Local Operation · Adam · Non-Local Block
