Multi-Agent Reinforcement Learning with Selective State-Space Models

Jemma Daniel; Ruan de Kock; Louay Ben Nessir; Sasha Abramowitz; Omayma; Mahjoub; Wiem Khlifi; Claude Formanek; Arnu Pretorius

arXiv:2410.19382·cs.LG·October 29, 2024

Multi-Agent Reinforcement Learning with Selective State-Space Models

Jemma Daniel, Ruan de Kock, Louay Ben Nessir, Sasha Abramowitz, Omayma, Mahjoub, Wiem Khlifi, Claude Formanek, Arnu Pretorius

PDF

Open Access

TL;DR

This paper introduces Multi-Agent Mamba, a scalable and efficient alternative to Transformer-based models in Multi-Agent Reinforcement Learning, matching performance while significantly improving computational efficiency and scalability to larger agent populations.

Contribution

The paper demonstrates that State-Space Models, specifically Mamba, can replace Transformers in MARL, maintaining performance and enhancing scalability.

Findings

01

MAM matches MAT's performance in multiple environments

02

MAM offers superior scalability to larger agent groups

03

SSMs can effectively replace Transformers in MARL

Abstract

The Transformer model has demonstrated success across a wide range of domains, including in Multi-Agent Reinforcement Learning (MARL) where the Multi-Agent Transformer (MAT) has emerged as a leading algorithm in the field. However, a significant drawback of Transformer models is their quadratic computational complexity relative to input size, making them computationally expensive when scaling to larger inputs. This limitation restricts MAT's scalability in environments with many agents. Recently, State-Space Models (SSMs) have gained attention due to their computational efficiency, but their application in MARL remains unexplored. In this work, we investigate the use of Mamba, a recent SSM, in MARL and assess whether it can match the performance of MAT while providing significant improvements in efficiency. We introduce a modified version of MAT that incorporates standard and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsAttention Is All You Need · Dense Connections · Label Smoothing · Byte Pair Encoding · Layer Normalization · Residual Connection · Linear Layer · Multi-Head Attention · Softmax · Adam