Multi-Agent Deep Reinforcement Learning Under Constrained Communications

Shahil Shaik; Jonathon M. Smereka; and Yue Wang

arXiv:2601.17069·cs.LG·January 27, 2026

Multi-Agent Deep Reinforcement Learning Under Constrained Communications

Shahil Shaik, Jonathon M. Smereka, and Yue Wang

PDF

Open Access

TL;DR

This paper introduces a fully distributed multi-agent reinforcement learning framework that eliminates the need for centralized information, enabling scalable, robust, and adaptable cooperation among agents using local observations and multi-hop communication.

Contribution

The paper proposes a novel Distributed Graph Attention Network (D-GAT) and a distributed MARL framework called DG-MAPPO that operate without centralized critics or global state information.

Findings

01

Outperforms strong CTDE baselines on multiple multi-agent benchmarks

02

Achieves superior coordination with both homogeneous and heterogeneous teams

03

Provides a scalable and robust decentralized learning approach

Abstract

Centralized training with decentralized execution (CTDE) has been the dominant paradigm in multi-agent reinforcement learning (MARL), but its reliance on global state information during training introduces scalability, robustness, and generalization bottlenecks. Moreover, in practical scenarios such as adding/dropping teammates or facing environment dynamics that differ from the training, CTDE methods can be brittle and costly to retrain, whereas distributed approaches allow agents to adapt using only local information and peer-to-peer communication. We present a distributed MARL framework that removes the need for centralized critics or global information. Firstly, we develop a novel Distributed Graph Attention Network (D-GAT) that performs global state inference through multi-hop communication, where agents integrate neighbor features via input-dependent attention weights in a fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning