Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen, Ke Wang, Guanghua Song, Xiaohong Jiang

TL;DR
This paper introduces a novel multi-agent reinforcement learning method that uses hierarchical graph attention networks and entropy optimization to learn stable continuous policies for large-scale multi-agent systems, improving exploration and performance.
Contribution
It presents a new MARL approach combining hierarchical graph attention and entropy-based policy optimization for continuous action spaces, enhancing scalability and transferability.
Findings
Outperforms baseline methods in large-scale cooperative tasks
Enables stable learning of continuous policies in multi-agent systems
Improves exploration and transferability of policies
Abstract
In most existing studies on large-scale multi-agent coordination, the control methods aim to learn discrete policies for agents with finite choices. They rarely consider selecting actions directly from continuous action spaces to provide more accurate control, which makes them unsuitable for more complex tasks. To solve the control issue due to large-scale multi-agent systems with continuous action spaces, we propose a novel MARL coordination control method that derives stable continuous policies. By optimizing policies with maximum entropy learning, agents improve their exploration in execution and acquire an excellent performance after training. We also employ hierarchical graph attention networks (HGAT) and gated recurrent units (GRU) to improve the scalability and transferability of our method. The experiments show that our method consistently outperforms all baselines in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
