Stateful active facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Dianbo Liu, Vedant Shah, Oussama Boussif, Cristian Meo, Anirudh Goyal,, Tianmin Shu, Michael Mozer, Nicolas Heess, Yoshua Bengio

TL;DR
This paper introduces HECOGrid, a suite of multi-agent RL environments with controllable heterogeneity and coordination levels, and proposes SAF, a novel training approach that outperforms baselines in diverse environments.
Contribution
The paper formalizes environment heterogeneity and coordination levels, and presents HECOGrid and SAF, a new learning method for efficient multi-agent cooperation in varied settings.
Findings
SAF outperforms IPPO and MAPPO across tasks
HECOGrid enables controlled evaluation of heterogeneity and coordination
SAF effectively handles high heterogeneity and coordination levels
Abstract
In cooperative multi-agent reinforcement learning, a team of agents works together to achieve a common goal. Different environments or tasks may require varying degrees of coordination among agents in order to achieve the goal in an optimal way. The nature of coordination will depend on the properties of the environment -- its spatial layout, distribution of obstacles, dynamics, etc. We term this variation of properties within an environment as heterogeneity. Existing literature has not sufficiently addressed the fact that different environments may have different levels of heterogeneity. We formalize the notions of coordination level and heterogeneity level of an environment and present HECOGrid, a suite of multi-agent RL environments that facilitates empirical evaluation of different MARL approaches across different levels of coordination and environmental heterogeneity by providing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Distributed and Parallel Computing Systems · Smart Grid Energy Management
