Learning Transferable Cooperative Behavior in Multi-Agent Teams

Akshat Agarwal; Sumit Kumar; Katia Sycara

arXiv:1906.01202·cs.LG·June 5, 2019·46 cites

Learning Transferable Cooperative Behavior in Multi-Agent Teams

Akshat Agarwal, Sumit Kumar, Katia Sycara

PDF

Open Access 4 Repos

TL;DR

This paper introduces a graph-based multi-agent reinforcement learning framework that enables agents to learn cooperative behaviors, generalize to different team sizes, and operate effectively in decentralized, real-world scenarios.

Contribution

The paper presents a novel shared agent-entity graph approach for multi-agent cooperation that is invariant to team size and permutation, with state-of-the-art results and strong zero-shot transfer capabilities.

Findings

01

Achieved state-of-the-art results on coverage, formation, and line control tasks.

02

Demonstrated rapid transfer to different team sizes with zero-shot generalization.

03

Operates effectively in fully decentralized settings.

Abstract

While multi-agent interactions can be naturally modeled as a graph, the environment has traditionally been considered as a black box. We propose to create a shared agent-entity graph, where agents and environmental entities form vertices, and edges exist between the vertices which can communicate with each other. Agents learn to cooperate by exchanging messages along the edges of this graph. Our proposed multi-agent reinforcement learning framework is invariant to the number of agents or entities present in the system as well as permutation invariance, both of which are desirable properties for any multi-agent system representation. We present state-of-the-art results on coverage, formation and line control tasks for multi-agent teams in a fully decentralized framework and further show that the learned policies quickly transfer to scenarios with different team sizes along with strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Domain Adaptation and Few-Shot Learning