Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning
Cong Guan, Feng Chen, Lei Yuan, Zongzhang Zhang, Yang Yu

TL;DR
This paper introduces MASIA, a self-supervised message aggregation method for multi-agent reinforcement learning that improves coordination by creating compact, relevant message representations, and also establishes offline benchmarks for communication research.
Contribution
The paper proposes a novel self-supervised message aggregation technique for cooperative MARL and creates the first offline benchmarks for multi-agent communication.
Findings
MASIA outperforms existing methods in online settings.
The offline benchmarks enable new research directions.
Empirical results validate the effectiveness of message aggregation.
Abstract
Utilizing messages from teammates can improve coordination in cooperative Multi-agent Reinforcement Learning (MARL). Previous works typically combine raw messages of teammates with local information as inputs for policy. However, neglecting message aggregation poses significant inefficiency for policy learning. Motivated by recent advances in representation learning, we argue that efficient message aggregation is essential for good coordination in cooperative MARL. In this paper, we propose Multi-Agent communication via Self-supervised Information Aggregation (MASIA), where agents can aggregate the received messages into compact representations with high relevance to augment the local policy. Specifically, we design a permutation invariant message encoder to generate common information-aggregated representation from messages and optimize it via reconstructing and shooting future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing
