GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement   Learning

Jingqing Ruan; Yali Du; Xuantang Xiong; Dengpeng Xing; Xiyun Li,; Linghui Meng; Haifeng Zhang; Jun Wang; Bo Xu

arXiv:2201.06257·cs.MA·January 19, 2022·25 cites

GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning

Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li,, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel graph-based coordination strategy for multi-agent reinforcement learning that factorizes joint policies into a graph generator and a coordinated policy, improving coordination in complex environments.

Contribution

It proposes a new method that uses directed acyclic graphs to model dynamic decision structures among agents, enhancing coordination capabilities.

Findings

01

Outperforms existing methods in collaborative tasks

02

Effectively balances efficiency and performance through DAG constraints

03

Demonstrates superior results in multiple multi-agent environments

Abstract

Many real-world scenarios involve a team of agents that have to coordinate their policies to achieve a shared goal. Previous studies mainly focus on decentralized control to maximize a common reward and barely consider the coordination among control policies, which is critical in dynamic and complicated environments. In this work, we propose factorizing the joint team policy into a graph generator and graph-based coordinated policy to enable coordinated behaviours among agents. The graph generator adopts an encoder-decoder framework that outputs directed acyclic graphs (DAGs) to capture the underlying dynamic decision structure. We also apply the DAGness-constrained and DAG depth-constrained optimization in the graph generator to balance efficiency and performance. The graph-based coordinated policy exploits the generated decision structure. The graph generator and coordinated policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amanda-1997/gcs_aamas337
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvacuation and Crowd Dynamics · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics