Multi-agent Deep Reinforcement Learning with Extremely Noisy   Observations

Ozsel Kilinc; Giovanni Montana

arXiv:1812.00922·cs.LG·December 4, 2018·21 cites

Multi-agent Deep Reinforcement Learning with Extremely Noisy Observations

Ozsel Kilinc, Giovanni Montana

PDF

Open Access

TL;DR

This paper introduces MADDPG-M, a multi-agent reinforcement learning algorithm that enables agents to learn effective communication strategies in environments with extremely noisy observations, improving collaboration and performance.

Contribution

The paper proposes a novel two-level learning mechanism for multi-agent deep reinforcement learning with noisy observations, incorporating learned communication policies without explicit feedback.

Findings

01

MADDPG-M outperforms baselines in six complex environments.

02

Agents learn when to share private observations effectively.

03

Communication policies improve overall system performance.

Abstract

Multi-agent reinforcement learning systems aim to provide interacting agents with the ability to collaboratively learn and adapt to the behaviour of other agents. In many real-world applications, the agents can only acquire a partial view of the world. Here we consider a setting whereby most agents' observations are also extremely noisy, hence only weakly correlated to the true state of the environment. Under these circumstances, learning an optimal policy becomes particularly challenging, even in the unrealistic case that an agent's policy can be made conditional upon all other agents' observations. To overcome these difficulties, we propose a multi-agent deep deterministic policy gradient algorithm enhanced by a communication medium (MADDPG-M), which implements a two-level, concurrent learning mechanism. An agent's policy depends on its own private observations as well as those…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Distributed Control Multi-Agent Systems