Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution
Yunpeng Bai, Chen Gong, Bin Zhang, Guoliang Fan, Xinwen Hou, Yu Liu

TL;DR
This paper introduces HGCN-MIX, a hypergraph convolution-based method for multi-agent reinforcement learning that improves coordination among agents, especially in large-scale scenarios, by modeling their relationships explicitly.
Contribution
It proposes a novel hypergraph convolution approach integrated with value decomposition to enhance agent coordination in multi-agent systems.
Findings
HGCN-MIX outperforms current state-of-the-art methods in StarCraft II benchmarks.
The method shows greater improvements with more agents involved.
Learning more relationships in the hypergraph leads to stronger joint policies.
Abstract
Recent years have witnessed the great success of multi-agent systems (MAS). Value decomposition, which decomposes joint action values into individual action values, has been an important work in MAS. However, many value decomposition methods ignore the coordination among different agents, leading to the notorious "lazy agents" problem. To enhance the coordination in MAS, this paper proposes HyperGraph CoNvolution MIX (HGCN-MIX), a method that incorporates hypergraph convolution with value decomposition. HGCN-MIX models agents as well as their relationships as a hypergraph, where agents are nodes and hyperedges among nodes indicate that the corresponding agents can coordinate to achieve larger rewards. Then, it trains a hypergraph that can capture the collaborative relationships among agents. Leveraging the learned hypergraph to consider how other agents' observations and actions affect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Reinforcement Learning in Robotics · Online Learning and Analytics
MethodsMixing Adam and SGD · Self-Learning · Convolution
