TarMAC: Targeted Multi-Agent Communication
Abhishek Das, Th\'eophile Gervet, Joshua Romoff, Dhruv Batra, Devi, Parikh, Michael Rabbat, Joelle Pineau

TL;DR
TarMAC introduces a targeted, multi-round communication framework for multi-agent reinforcement learning that improves cooperation, interpretability, and adaptability across various environments and task complexities.
Contribution
It presents a novel targeted communication architecture that learns who to communicate with and how, solely from task rewards, enhancing multi-agent cooperation.
Findings
Improved performance over state-of-the-art methods.
Communication strategies are interpretable and intuitive.
Effective in diverse environments and multi-agent settings.
Abstract
We propose a targeted communication architecture for multi-agent reinforcement learning, where agents learn both what messages to send and whom to address them to while performing cooperative tasks in partially-observable environments. This targeting behavior is learnt solely from downstream task-specific reward without any communication supervision. We additionally augment this with a multi-round communication approach where agents coordinate via multiple rounds of communication before taking actions in the environment. We evaluate our approach on a diverse set of cooperative multi-agent tasks, of varying difficulties, with varying number of agents, in a variety of environments ranging from 2D grid layouts of shapes and simulated traffic junctions to 3D indoor environments, and demonstrate the benefits of targeted and multi-round communication. Moreover, we show that the targeted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Adversarial Robustness in Machine Learning
