Bridging MARL to SARL: An Order-Independent Multi-Agent Transformer via Latent Consensus

Zijian Zhao; Jing Gao; Sen Li

arXiv:2604.13472·cs.LG·April 16, 2026

Bridging MARL to SARL: An Order-Independent Multi-Agent Transformer via Latent Consensus

Zijian Zhao, Jing Gao, Sen Li

PDF

1 Repo

TL;DR

This paper introduces CMAT, a novel multi-agent transformer framework that achieves order-independent decision-making and superior coordination in cooperative MARL tasks by unifying agents into a hierarchical single-agent model.

Contribution

The paper presents a hierarchical Transformer-based framework that bridges MARL to SARL, enabling order-independent joint decision-making and improved coordination.

Findings

01

CMAT outperforms recent centralized solutions in benchmark tasks.

02

The hierarchical decision mechanism enables order-independent joint actions.

03

CMAT maintains expressive coordination while using single-agent PPO for optimization.

Abstract

Cooperative multi-agent reinforcement learning (MARL) is widely used to address large joint observation and action spaces by decomposing a centralized control problem into multiple interacting agents. However, such decomposition often introduces additional challenges, including non-stationarity, unstable training, weak coordination, and limited theoretical guarantees. In this paper, we propose the Consensus Multi-Agent Transformer (CMAT), a centralized framework that bridges cooperative MARL to a hierarchical single-agent reinforcement learning (SARL) formulation. CMAT treats all agents as a unified entity and employs a Transformer encoder to process the large joint observation space. To handle the extensive joint action space, we introduce a hierarchical decision-making mechanism in which a Transformer decoder autoregressively generates a high-level consensus vector, simulating the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RS2002/CMAT
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.