Multi-agent Deep Reinforcement Learning with Extremely Noisy Observations
Ozsel Kilinc, Giovanni Montana

TL;DR
This paper introduces MADDPG-M, a multi-agent reinforcement learning algorithm that enables agents to learn effective communication strategies in environments with extremely noisy observations, improving collaboration and performance.
Contribution
The paper proposes a novel two-level learning mechanism for multi-agent deep reinforcement learning with noisy observations, incorporating learned communication policies without explicit feedback.
Findings
MADDPG-M outperforms baselines in six complex environments.
Agents learn when to share private observations effectively.
Communication policies improve overall system performance.
Abstract
Multi-agent reinforcement learning systems aim to provide interacting agents with the ability to collaboratively learn and adapt to the behaviour of other agents. In many real-world applications, the agents can only acquire a partial view of the world. Here we consider a setting whereby most agents' observations are also extremely noisy, hence only weakly correlated to the true state of the environment. Under these circumstances, learning an optimal policy becomes particularly challenging, even in the unrealistic case that an agent's policy can be made conditional upon all other agents' observations. To overcome these difficulties, we propose a multi-agent deep deterministic policy gradient algorithm enhanced by a communication medium (MADDPG-M), which implements a two-level, concurrent learning mechanism. An agent's policy depends on its own private observations as well as those…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Distributed Control Multi-Agent Systems
