Learning Practical Communication Strategies in Cooperative Multi-Agent   Reinforcement Learning

Diyi Hu; Chi Zhang; Viktor Prasanna; Bhaskar Krishnamachari

arXiv:2209.01288·cs.AI·September 16, 2022·1 cites

Learning Practical Communication Strategies in Cooperative Multi-Agent Reinforcement Learning

Diyi Hu, Chi Zhang, Viktor Prasanna, Bhaskar Krishnamachari

PDF

Open Access

TL;DR

This paper introduces a framework for multi-agent reinforcement learning that optimizes communication strategies considering wireless network unreliability, improving cooperation, efficiency, and convergence in realistic settings.

Contribution

It presents a novel approach to learn when, what, and how agents communicate in wireless environments, incorporating network conditions and a new neural message encoder.

Findings

01

Significant performance improvements over state-of-the-art methods.

02

Faster convergence and higher communication efficiency.

03

Robust cooperation in realistic wireless network simulations.

Abstract

In Multi-Agent Reinforcement Learning, communication is critical to encourage cooperation among agents. Communication in realistic wireless networks can be highly unreliable due to network conditions varying with agents' mobility, and stochasticity in the transmission process. We propose a framework to learn practical communication strategies by addressing three fundamental questions: (1) When: Agents learn the timing of communication based on not only message importance but also wireless channel conditions. (2) What: Agents augment message contents with wireless network measurements to better select the game and communication actions. (3) How: Agents use a novel neural message encoder to preserve all information from received messages, regardless of the number and order of messages. Simulating standard benchmarks under realistic wireless network settings, we show significant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpinion Dynamics and Social Influence

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings