Learning Multi-agent Action Coordination via Electing First-move Agent
Jingqing Ruan, Linghui Meng, Xuantang Xiong, Dengpeng Xing, and Bo Xu

TL;DR
This paper introduces a novel bi-level decision hierarchy with an election mechanism using graph convolutional networks to enable asynchronous multi-agent action coordination, improving performance in cooperative tasks.
Contribution
It presents the first explicit model for asynchronous multi-agent coordination, including a new election mechanism and a dynamically weighted mixing network for better value estimation.
Findings
Achieves superior performance in Cooperative Navigation
Outperforms existing methods in Google Football
Effectively models asynchronous agent interactions
Abstract
Learning to coordinate actions among agents is essential in complicated multi-agent systems. Prior works are constrained mainly by the assumption that all agents act simultaneously, and asynchronous action coordination between agents is rarely considered. This paper introduces a bi-level multi-agent decision hierarchy for coordinated behavior planning. We propose a novel election mechanism in which we adopt a graph convolutional network to model the interaction among agents and elect a first-move agent for asynchronous guidance. We also propose a dynamically weighted mixing network to effectively reduce the misestimation of the value function during training. This work is the first to explicitly model the asynchronous multi-agent action coordination, and this explicitness enables to choose the optimal first-move agent. The results on Cooperative Navigation and Google Football…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics
