Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning

Sheng Li; Jayesh K. Gupta; Peter Morales; Ross Allen; Mykel J.; Kochenderfer

arXiv:2006.11438·cs.LG·February 5, 2021·38 cites

Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning

Sheng Li, Jayesh K. Gupta, Peter Morales, Ross Allen, Mykel J., Kochenderfer

PDF

Open Access 1 Repo

TL;DR

This paper introduces Deep Implicit Coordination Graphs (DICG), a novel MARL architecture that infers dynamic coordination structures and uses graph neural networks to improve multi-agent cooperation in complex tasks.

Contribution

DICG is the first method to learn dynamic coordination graphs in MARL, enabling scalable reasoning about joint actions without domain-specific design.

Findings

01

DICG effectively solves relative overgeneralization in prey-predator tasks.

02

DICG outperforms baselines on StarCraft II Multi-agent Challenge.

03

DICG improves coordination in traffic junction environments.

Abstract

Multi-agent reinforcement learning (MARL) requires coordination to efficiently solve certain tasks. Fully centralized control is often infeasible in such domains due to the size of joint action spaces. Coordination graph based formalization allows reasoning about the joint action based on the structure of interactions. However, they often require domain expertise in their design. This paper introduces the deep implicit coordination graph (DICG) architecture for such scenarios. DICG consists of a module for inferring the dynamic coordination graph structure which is then used by a graph neural network based module to learn to implicitly reason about the joint actions or values. DICG allows learning the tradeoff between full centralization and decentralization via standard actor-critic methods to significantly improve coordination for domains with large number of agents. We apply DICG to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sisl/DICG
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Cancer-related gene regulation

MethodsGraph Neural Network