Hierarchical Task Network Planning for Facilitating Cooperative   Multi-Agent Reinforcement Learning

Xuechen Mu; Hankz Hankui Zhuo; Chen Chen; Kai Zhang; Chao Yu and; Jianye Hao

arXiv:2306.08359·cs.AI·June 16, 2023·1 cites

Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning

Xuechen Mu, Hankz Hankui Zhuo, Chen Chen, Kai Zhang, Chao Yu and, Jianye Hao

PDF

Open Access

TL;DR

This paper introduces SOMARL, a hierarchical framework combining HTN planning and MARL to improve cooperative multi-agent learning in sparse reward environments by leveraging symbolic knowledge and domain guidance.

Contribution

The paper presents a novel hierarchical framework integrating HTN planning with MARL, utilizing symbolic knowledge to enhance exploration and performance in cooperative multi-agent tasks.

Findings

01

SOMARL outperforms state-of-the-art MARL methods on benchmarks.

02

Hierarchical structure effectively guides exploration in sparse reward settings.

03

Symbolic knowledge integration improves learning efficiency and success rate.

Abstract

Exploring sparse reward multi-agent reinforcement learning (MARL) environments with traps in a collaborative manner is a complex task. Agents typically fail to reach the goal state and fall into traps, which affects the overall performance of the system. To overcome this issue, we present SOMARL, a framework that uses prior knowledge to reduce the exploration space and assist learning. In SOMARL, agents are treated as part of the MARL environment, and symbolic knowledge is embedded using a tree structure to build a knowledge hierarchy. The framework has a two-layer hierarchical structure, comprising a hybrid module with a Hierarchical Task Network (HTN) planning and meta-controller at the higher level, and a MARL-based interactive module at the lower level. The HTN module and meta-controller use Hierarchical Domain Definition Language (HDDL) and the option framework to formalize…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications

Methodsfail