TL;DR
This paper introduces CMAT, a novel multi-agent transformer framework that achieves order-independent decision-making and superior coordination in cooperative MARL tasks by unifying agents into a hierarchical single-agent model.
Contribution
The paper presents a hierarchical Transformer-based framework that bridges MARL to SARL, enabling order-independent joint decision-making and improved coordination.
Findings
CMAT outperforms recent centralized solutions in benchmark tasks.
The hierarchical decision mechanism enables order-independent joint actions.
CMAT maintains expressive coordination while using single-agent PPO for optimization.
Abstract
Cooperative multi-agent reinforcement learning (MARL) is widely used to address large joint observation and action spaces by decomposing a centralized control problem into multiple interacting agents. However, such decomposition often introduces additional challenges, including non-stationarity, unstable training, weak coordination, and limited theoretical guarantees. In this paper, we propose the Consensus Multi-Agent Transformer (CMAT), a centralized framework that bridges cooperative MARL to a hierarchical single-agent reinforcement learning (SARL) formulation. CMAT treats all agents as a unified entity and employs a Transformer encoder to process the large joint observation space. To handle the extensive joint action space, we introduce a hierarchical decision-making mechanism in which a Transformer decoder autoregressively generates a high-level consensus vector, simulating the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
