Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning
Xuechen Mu, Hankz Hankui Zhuo, Chen Chen, Kai Zhang, Chao Yu and, Jianye Hao

TL;DR
This paper introduces SOMARL, a hierarchical framework combining HTN planning and MARL to improve cooperative multi-agent learning in sparse reward environments by leveraging symbolic knowledge and domain guidance.
Contribution
The paper presents a novel hierarchical framework integrating HTN planning with MARL, utilizing symbolic knowledge to enhance exploration and performance in cooperative multi-agent tasks.
Findings
SOMARL outperforms state-of-the-art MARL methods on benchmarks.
Hierarchical structure effectively guides exploration in sparse reward settings.
Symbolic knowledge integration improves learning efficiency and success rate.
Abstract
Exploring sparse reward multi-agent reinforcement learning (MARL) environments with traps in a collaborative manner is a complex task. Agents typically fail to reach the goal state and fall into traps, which affects the overall performance of the system. To overcome this issue, we present SOMARL, a framework that uses prior knowledge to reduce the exploration space and assist learning. In SOMARL, agents are treated as part of the MARL environment, and symbolic knowledge is embedded using a tree structure to build a knowledge hierarchy. The framework has a two-layer hierarchical structure, comprising a hybrid module with a Hierarchical Task Network (HTN) planning and meta-controller at the higher level, and a MARL-based interactive module at the lower level. The HTN module and meta-controller use Hierarchical Domain Definition Language (HDDL) and the option framework to formalize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications
Methodsfail
