TarMAC: Targeted Multi-Agent Communication

Abhishek Das; Th\'eophile Gervet; Joshua Romoff; Dhruv Batra; Devi; Parikh; Michael Rabbat; Joelle Pineau

arXiv:1810.11187·cs.LG·February 25, 2020·26 cites

TarMAC: Targeted Multi-Agent Communication

Abhishek Das, Th\'eophile Gervet, Joshua Romoff, Dhruv Batra, Devi, Parikh, Michael Rabbat, Joelle Pineau

PDF

Open Access

TL;DR

TarMAC introduces a targeted, multi-round communication framework for multi-agent reinforcement learning that improves cooperation, interpretability, and adaptability across various environments and task complexities.

Contribution

It presents a novel targeted communication architecture that learns who to communicate with and how, solely from task rewards, enhancing multi-agent cooperation.

Findings

01

Improved performance over state-of-the-art methods.

02

Communication strategies are interpretable and intuitive.

03

Effective in diverse environments and multi-agent settings.

Abstract

We propose a targeted communication architecture for multi-agent reinforcement learning, where agents learn both what messages to send and whom to address them to while performing cooperative tasks in partially-observable environments. This targeting behavior is learnt solely from downstream task-specific reward without any communication supervision. We additionally augment this with a multi-round communication approach where agents coordinate via multiple rounds of communication before taking actions in the environment. We evaluate our approach on a diverse set of cooperative multi-agent tasks, of varying difficulties, with varying number of agents, in a variety of environments ranging from 2D grid layouts of shapes and simulated traffic junctions to 3D indoor environments, and demonstrate the benefits of targeted and multi-round communication. Moreover, we show that the targeted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Adversarial Robustness in Machine Learning