R-MADDPG for Partially Observable Environments and Limited Communication

Rose E. Wang; Michael Everett; Jonathan P. How

arXiv:2002.06684·cs.MA·February 19, 2020·64 cites

R-MADDPG for Partially Observable Environments and Limited Communication

Rose E. Wang, Michael Everett, Jonathan P. How

PDF

Open Access 1 Repo

TL;DR

This paper introduces R-MADDPG, a deep recurrent multiagent reinforcement learning framework designed to improve coordination in partially observable environments with limited communication, demonstrating effective learning of communication patterns and resource sharing.

Contribution

The paper presents a novel R-MADDPG framework that incorporates recurrence to handle partial observability and limited communication in multiagent systems, advancing coordination capabilities.

Findings

01

Learns time dependencies for sharing observations

02

Develops different communication patterns among agents

03

Handles resource limitations effectively

Abstract

There are several real-world tasks that would benefit from applying multiagent reinforcement learning (MARL) algorithms, including the coordination among self-driving cars. The real world has challenging conditions for multiagent learning systems, such as its partial observable and nonstationary nature. Moreover, if agents must share a limited resource (e.g. network bandwidth) they must all learn how to coordinate resource use. This paper introduces a deep recurrent multiagent actor-critic framework (R-MADDPG) for handling multiagent coordination under partial observable set-tings and limited communication. We investigate recurrency effects on performance and communication use of a team of agents. We demonstrate that the resulting framework learns time dependencies for sharing missing observations, handling resource limitations, and developing different communication patterns among…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rosewang2008/rmaddpg
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Evolutionary Algorithms and Applications