ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with   Deep Multi-agent Reinforcement Learning

Hangyu Mao; Zhibo Gong; Yan Ni; Zhen Xiao

arXiv:1706.03235·cs.AI·October 31, 2017·40 cites

ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning

Hangyu Mao, Zhibo Gong, Yan Ni, Zhen Xiao

PDF

Open Access

TL;DR

This paper introduces ACCNet, a deep reinforcement learning framework that enables multi-agent systems to learn effective communication protocols from scratch in partially observable environments, outperforming existing baselines.

Contribution

The paper presents ACCNet, a novel actor-coordinator-critic architecture that integrates deep learning with reinforcement learning to learn communication protocols without predefined schemes.

Findings

01

ACCNet outperforms baseline methods in various environments.

02

It effectively learns communication protocols from scratch.

03

The learned protocols are interpretable and adaptable.

Abstract

Communication is a critical factor for the big multi-agent world to stay organized and productive. Typically, most previous multi-agent "learning-to-communicate" studies try to predefine the communication protocols or use technologies such as tabular reinforcement learning and evolutionary algorithm, which can not generalize to changing environment or large collection of agents. In this paper, we propose an Actor-Coordinator-Critic Net (ACCNet) framework for solving "learning-to-communicate" problem. The ACCNet naturally combines the powerful actor-critic reinforcement learning technology with deep learning technology. It can efficiently learn the communication protocols even from scratch under partially observable environment. We demonstrate that the ACCNet can achieve better results than several baselines under both continuous and discrete action space environments. We also analyse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Reservoir Computing · Distributed Control Multi-Agent Systems