Distributed Cooperative Multi-Agent Reinforcement Learning with Directed   Coordination Graph

Gangshan Jing; He Bai; Jemin George; Aranya Chakrabortty; Piyush.; K. Sharma

arXiv:2201.04962·cs.MA·January 14, 2022

Distributed Cooperative Multi-Agent Reinforcement Learning with Directed Coordination Graph

Gangshan Jing, He Bai, Jemin George, Aranya Chakrabortty, Piyush., K. Sharma

PDF

Open Access

TL;DR

This paper introduces a scalable distributed multi-agent reinforcement learning algorithm that operates on directed graphs, avoiding costly consensus procedures and improving efficiency in cooperative multi-agent systems.

Contribution

It proposes a novel distributed RL method using local value functions and directed communication, eliminating the need for consensus algorithms and enhancing scalability.

Findings

01

The algorithm achieves high scalability compared to existing ZOO-based RL methods.

02

It effectively handles resource allocation problems in multi-agent systems.

03

The approach reduces communication costs by leveraging local information and directed graphs.

Abstract

Existing distributed cooperative multi-agent reinforcement learning (MARL) frameworks usually assume undirected coordination graphs and communication graphs while estimating a global reward via consensus algorithms for policy evaluation. Such a framework may induce expensive communication costs and exhibit poor scalability due to requirement of global consensus. In this work, we study MARLs with directed coordination graphs, and propose a distributed RL algorithm where the local policy evaluations are based on local value functions. The local value function of each agent is obtained by local communication with its neighbors through a directed learning-induced communication graph, without using any consensus algorithm. A zeroth-order optimization (ZOO) approach based on parameter perturbation is employed to achieve gradient estimation. By comparing with existing ZOO-based RL algorithms,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Adaptive Dynamic Programming Control · Reinforcement Learning in Robotics