Diversity is Strength: Mastering Football Full Game with Interactive Reinforcement Learning of Multiple AIs
Chenglu Sun, Shuo Shen, Sijia Xu, Weidong Zhang

TL;DR
This paper introduces a novel reinforcement learning framework called Diversity is Strength (DIS) that trains multiple AI agents with diverse strategies in complex multi-agent environments, leading to high-performance football AIs without human data.
Contribution
The paper proposes the DIS framework, which enhances AI diversity and strength through interconnected models and evaluation schemes, achieving state-of-the-art results in Google Research Football competitions.
Findings
Won 5v5 and 11v11 tracks in AI football competitions
AI exhibits rich, diverse strategies in complex multi-agent settings
Ablation studies confirm effectiveness of proposed modules
Abstract
Training AI with strong and rich strategies in multi-agent environments remains an important research topic in Deep Reinforcement Learning (DRL). The AI's strength is closely related to its diversity of strategies, and this relationship can guide us to train AI with both strong and rich strategies. To prove this point, we propose Diversity is Strength (DIS), a novel DRL training framework that can simultaneously train multiple kinds of AIs. These AIs are linked through an interconnected history model pool structure, which enhances their capabilities and strategy diversities. We also design a model evaluation and screening scheme to select the best models to enrich the model pool and obtain the final AI. The proposed training method provides diverse, generalizable, and strong AI strategies without using human data. We tested our method in an AI competition based on Google Research…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance · Reinforcement Learning in Robotics
MethodsSix Ways To Communicate To Someone At Expedia Via Phone And Email's. · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · 1x1 Convolution · Feedforward Network · Two Time-scale Update Rule · Projection Discriminator · Non-Local Operation · Adam · Non-Local Block
