Minimizing Communication while Maximizing Performance in Multi-Agent   Reinforcement Learning

Varun Kumar Vijay; Hassam Sheikh; Somdeb Majumdar; Mariano; Phielipp

arXiv:2106.08482·cs.AI·December 9, 2021

Minimizing Communication while Maximizing Performance in Multi-Agent Reinforcement Learning

Varun Kumar Vijay, Hassam Sheikh, Somdeb Majumdar, Mariano, Phielipp

PDF

Open Access

TL;DR

This paper proposes a method to reduce communication in multi-agent reinforcement learning by optimizing a combined objective with a penalty, achieving up to 75% less communication without performance loss.

Contribution

It introduces a novel training approach with techniques like partial training and message forwarding to minimize communication while maintaining performance in multi-agent tasks.

Findings

01

Reduced communication by 75% without performance loss.

02

Training with a communication penalty on 50% of episodes stabilizes learning.

03

Message forwarding improves information retention and performance.

Abstract

Inter-agent communication can significantly increase performance in multi-agent tasks that require co-ordination to achieve a shared goal. Prior work has shown that it is possible to learn inter-agent communication protocols using multi-agent reinforcement learning and message-passing network architectures. However, these models use an unconstrained broadcast communication model, in which an agent communicates with all other agents at every step, even when the task does not require it. In real-world applications, where communication may be limited by system constraints like bandwidth, power and network capacity, one might need to reduce the number of messages that are sent. In this work, we explore a simple method of minimizing communication while maximizing performance in multi-task learning: simultaneously optimizing a task-specific objective and a communication penalty. We show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices