Learning to Communicate to Solve Riddles with Deep Distributed Recurrent   Q-Networks

Jakob N. Foerster; Yannis M. Assael; Nando de Freitas; Shimon Whiteson

arXiv:1602.02672·cs.AI·February 9, 2016·86 cites

Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks

Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson

PDF

Open Access

TL;DR

This paper introduces DDRQN, a deep reinforcement learning approach enabling multi-agent teams to autonomously develop communication protocols to solve riddles, marking the first success in learning communication protocols with deep RL.

Contribution

The paper presents DDRQN, a novel deep distributed recurrent Q-network architecture that enables agents to learn communication protocols from scratch in multi-agent tasks.

Findings

01

DDRQN successfully solves communication-based riddles.

02

Agents develop effective, elegant communication protocols.

03

Each component of DDRQN is critical for its success.

Abstract

We propose deep distributed recurrent Q-networks (DDRQN), which enable teams of agents to learn to solve communication-based coordination tasks. In these tasks, the agents are not given any pre-designed communication protocol. Therefore, in order to successfully communicate, they must first automatically develop and agree upon their own communication protocol. We present empirical results on two multi-agent learning problems based on well-known riddles, demonstrating that DDRQN can successfully solve such tasks and discover elegant communication protocols to do so. To our knowledge, this is the first time deep reinforcement learning has succeeded in learning communication protocols. In addition, we present ablation experiments that confirm that each of the main components of the DDRQN architecture are critical to its success.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Modular Robots and Swarm Intelligence · Distributed Control Multi-Agent Systems