Efficient Communication via Self-supervised Information Aggregation for   Online and Offline Multi-agent Reinforcement Learning

Cong Guan; Feng Chen; Lei Yuan; Zongzhang Zhang; Yang Yu

arXiv:2302.09605·cs.LG·February 21, 2023·1 cites

Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning

Cong Guan, Feng Chen, Lei Yuan, Zongzhang Zhang, Yang Yu

PDF

Open Access

TL;DR

This paper introduces MASIA, a self-supervised message aggregation method for multi-agent reinforcement learning that improves coordination by creating compact, relevant message representations, and also establishes offline benchmarks for communication research.

Contribution

The paper proposes a novel self-supervised message aggregation technique for cooperative MARL and creates the first offline benchmarks for multi-agent communication.

Findings

01

MASIA outperforms existing methods in online settings.

02

The offline benchmarks enable new research directions.

03

Empirical results validate the effectiveness of message aggregation.

Abstract

Utilizing messages from teammates can improve coordination in cooperative Multi-agent Reinforcement Learning (MARL). Previous works typically combine raw messages of teammates with local information as inputs for policy. However, neglecting message aggregation poses significant inefficiency for policy learning. Motivated by recent advances in representation learning, we argue that efficient message aggregation is essential for good coordination in cooperative MARL. In this paper, we propose Multi-Agent communication via Self-supervised Information Aggregation (MASIA), where agents can aggregate the received messages into compact representations with high relevance to augment the local policy. Specifically, we design a permutation invariant message encoder to generate common information-aggregated representation from messages and optimize it via reconstructing and shooting future…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing