Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph
Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty, Piyush., K. Sharma

TL;DR
This paper introduces a scalable distributed multi-agent reinforcement learning algorithm that operates on directed graphs, avoiding costly consensus procedures and improving efficiency in cooperative multi-agent systems.
Contribution
It proposes a novel distributed RL method using local value functions and directed communication, eliminating the need for consensus algorithms and enhancing scalability.
Findings
The algorithm achieves high scalability compared to existing ZOO-based RL methods.
It effectively handles resource allocation problems in multi-agent systems.
The approach reduces communication costs by leveraging local information and directed graphs.
Abstract
Existing distributed cooperative multi-agent reinforcement learning (MARL) frameworks usually assume undirected coordination graphs and communication graphs while estimating a global reward via consensus algorithms for policy evaluation. Such a framework may induce expensive communication costs and exhibit poor scalability due to requirement of global consensus. In this work, we study MARLs with directed coordination graphs, and propose a distributed RL algorithm where the local policy evaluations are based on local value functions. The local value function of each agent is obtained by local communication with its neighbors through a directed learning-induced communication graph, without using any consensus algorithm. A zeroth-order optimization (ZOO) approach based on parameter perturbation is employed to achieve gradient estimation. By comparing with existing ZOO-based RL algorithms,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Adaptive Dynamic Programming Control · Reinforcement Learning in Robotics
