An Empirical Study on Google Research Football Multi-agent Scenarios

Yan Song; He Jiang; Zheng Tian; Haifeng Zhang; Yingping Zhang,; Jiangcheng Zhu; Zonghong Dai; Weinan Zhang; Jun Wang,

arXiv:2305.09458·cs.LG·February 21, 2024·1 cites

An Empirical Study on Google Research Football Multi-agent Scenarios

Yan Song, He Jiang, Zheng Tian, Haifeng Zhang, Yingping Zhang,, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang, Jun Wang,

PDF

Open Access 1 Repo

TL;DR

This paper presents a comprehensive empirical study on multi-agent reinforcement learning in Google Research Football, introducing a new training pipeline, open-source framework, and benchmarks to advance research in full-game scenarios.

Contribution

It provides a population-based training pipeline, open-source framework, and benchmark results for multi-agent football scenarios, filling a gap in existing research and tools.

Findings

01

Outperforms difficulty 1.0 bots within 2 million steps

02

Provides a reference for Independent Proximal Policy Optimization (IPPO) performance

03

Open-sources training framework Light-MALib with analytical tools

Abstract

Few multi-agent reinforcement learning (MARL) research on Google Research Football (GRF) focus on the 11v11 multi-agent full-game scenario and to the best of our knowledge, no open benchmark on this scenario has been released to the public. In this work, we fill the gap by providing a population-based MARL training pipeline and hyperparameter settings on multi-agent football scenario that outperforms the bot with difficulty 1.0 from scratch within 2 million steps. Our experiments serve as a reference for the expected performance of Independent Proximal Policy Optimization (IPPO), a state-of-the-art multi-agent reinforcement learning algorithm where each agent tries to maximize its own policy independently across various training configurations. Meanwhile, we open-source our training framework Light-MALib which extends the MALib codebase by distributed and asynchronized implementation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shanghai-digital-brain-laboratory/db-football
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Games and Gamification · Reinforcement Learning in Robotics · Data Stream Mining Techniques

MethodsSix Ways To Communicate To Someone At Expedia Via Phone And Email's. · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · 1x1 Convolution · Feedforward Network · Two Time-scale Update Rule · Projection Discriminator · Non-Local Operation · Adam · Non-Local Block