Multi-Agent Deep Reinforcement Learning Under Constrained Communications
Shahil Shaik, Jonathon M. Smereka, and Yue Wang

TL;DR
This paper introduces a fully distributed multi-agent reinforcement learning framework that eliminates the need for centralized information, enabling scalable, robust, and adaptable cooperation among agents using local observations and multi-hop communication.
Contribution
The paper proposes a novel Distributed Graph Attention Network (D-GAT) and a distributed MARL framework called DG-MAPPO that operate without centralized critics or global state information.
Findings
Outperforms strong CTDE baselines on multiple multi-agent benchmarks
Achieves superior coordination with both homogeneous and heterogeneous teams
Provides a scalable and robust decentralized learning approach
Abstract
Centralized training with decentralized execution (CTDE) has been the dominant paradigm in multi-agent reinforcement learning (MARL), but its reliance on global state information during training introduces scalability, robustness, and generalization bottlenecks. Moreover, in practical scenarios such as adding/dropping teammates or facing environment dynamics that differ from the training, CTDE methods can be brittle and costly to retrain, whereas distributed approaches allow agents to adapt using only local information and peer-to-peer communication. We present a distributed MARL framework that removes the need for centralized critics or global information. Firstly, we develop a novel Distributed Graph Attention Network (D-GAT) that performs global state inference through multi-hop communication, where agents integrate neighbor features via input-dependent attention weights in a fully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning
