GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning
Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li,, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu

TL;DR
This paper introduces a novel graph-based coordination strategy for multi-agent reinforcement learning that factorizes joint policies into a graph generator and a coordinated policy, improving coordination in complex environments.
Contribution
It proposes a new method that uses directed acyclic graphs to model dynamic decision structures among agents, enhancing coordination capabilities.
Findings
Outperforms existing methods in collaborative tasks
Effectively balances efficiency and performance through DAG constraints
Demonstrates superior results in multiple multi-agent environments
Abstract
Many real-world scenarios involve a team of agents that have to coordinate their policies to achieve a shared goal. Previous studies mainly focus on decentralized control to maximize a common reward and barely consider the coordination among control policies, which is critical in dynamic and complicated environments. In this work, we propose factorizing the joint team policy into a graph generator and graph-based coordinated policy to enable coordinated behaviours among agents. The graph generator adopts an encoder-decoder framework that outputs directed acyclic graphs (DAGs) to capture the underlying dynamic decision structure. We also apply the DAGness-constrained and DAG depth-constrained optimization in the graph generator to balance efficiency and performance. The graph-based coordinated policy exploits the generated decision structure. The graph generator and coordinated policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvacuation and Crowd Dynamics · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics
