Scalable Communication for Multi-Agent Reinforcement Learning via   Transformer-Based Email Mechanism

Xudong Guo; Daming Shi; Wenhui Fan

arXiv:2301.01919·cs.MA·June 13, 2023·1 cites

Scalable Communication for Multi-Agent Reinforcement Learning via Transformer-Based Email Mechanism

Xudong Guo, Daming Shi, Wenhui Fan

PDF

Open Access

TL;DR

This paper introduces a scalable Transformer-based email mechanism for multi-agent reinforcement learning that enables efficient, targeted communication among agents, improving cooperation in partially-observed environments without increasing complexity as agent numbers grow.

Contribution

The paper proposes a novel Transformer-based email mechanism that enables scalable, targeted communication in multi-agent reinforcement learning, inspired by human email forwarding.

Findings

01

TEM outperforms baselines on multiple benchmarks.

02

TEM maintains performance with varying agent numbers.

03

TEM does not require retraining when agent count changes.

Abstract

Communication can impressively improve cooperation in multi-agent reinforcement learning (MARL), especially for partially-observed tasks. However, existing works either broadcast the messages leading to information redundancy, or learn targeted communication by modeling all the other agents as targets, which is not scalable when the number of agents varies. In this work, to tackle the scalability problem of MARL communication for partially-observed tasks, we propose a novel framework Transformer-based Email Mechanism (TEM). The agents adopt local communication to send messages only to the ones that can be observed without modeling all the agents. Inspired by human cooperation with email forwarding, we design message chains to forward information to cooperate with the agents outside the observation range. We introduce Transformer to encode and decode the message chain to choose the next…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpinion Dynamics and Social Influence · Complex Network Analysis Techniques · Impact of Technology on Adolescents

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Softmax · Adam · Byte Pair Encoding · Residual Connection · Label Smoothing · Dropout · Dense Connections