Inferring Latent Temporal Sparse Coordination Graph for Multi-Agent Reinforcement Learning
Wei Duan, Jie Lu, Junyu Xuan

TL;DR
This paper introduces a novel Latent Temporal Sparse Coordination Graph (LTS-CG) for multi-agent reinforcement learning, utilizing historical data to improve agent cooperation and scalability.
Contribution
It proposes a new graph inference method that captures agent dependencies over time, reducing computational complexity and enhancing cooperation in MARL.
Findings
LTS-CG outperforms existing methods on StarCraft II benchmark.
The approach effectively captures temporal dependencies and agent relations.
Scalability is improved due to complexity depending only on number of agents.
Abstract
Effective agent coordination is crucial in cooperative Multi-Agent Reinforcement Learning (MARL). While agent cooperation can be represented by graph structures, prevailing graph learning methods in MARL are limited. They rely solely on one-step observations, neglecting crucial historical experiences, leading to deficient graphs that foster redundant or detrimental information exchanges. Additionally, high computational demands for action-pair calculations in dense graphs impede scalability. To address these challenges, we propose inferring a Latent Temporal Sparse Coordination Graph (LTS-CG) for MARL. The LTS-CG leverages agents' historical observations to calculate an agent-pair probability matrix, where a sparse graph is sampled from and used for knowledge exchange between agents, thereby simultaneously capturing agent dependencies and relation uncertainty. The computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
