Guided Cooperation in Hierarchical Reinforcement Learning via   Model-based Rollout

Haoran Wang; Zeshen Tang; Leya Yang; Yaoru Sun; Fang Wang; Siyu Zhang,; Yeming Chen

arXiv:2309.13508·cs.LG·April 9, 2024

Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout

Haoran Wang, Zeshen Tang, Leya Yang, Yaoru Sun, Fang Wang, Siyu Zhang,, Yeming Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces GCMR, a goal-conditioned hierarchical reinforcement learning framework that enhances inter-level cooperation and stability through model-based rollout, gradient penalties, and guided planning, leading to improved policy performance.

Contribution

The paper proposes GCMR, a novel HRL framework that integrates model-based rollout, gradient constraints, and one-step planning to improve inter-level cooperation and exploration.

Findings

01

GCMR improves sample efficiency and policy stability.

02

GCMR outperforms state-of-the-art algorithms in experiments.

03

Incorporating GCMR yields more robust hierarchical policies.

Abstract

Goal-conditioned hierarchical reinforcement learning (HRL) presents a promising approach for enabling effective exploration in complex, long-horizon reinforcement learning (RL) tasks through temporal abstraction. Empirically, heightened inter-level communication and coordination can induce more stable and robust policy improvement in hierarchical systems. Yet, most existing goal-conditioned HRL algorithms have primarily focused on the subgoal discovery, neglecting inter-level cooperation. Here, we propose a goal-conditioned HRL framework named Guided Cooperation via Model-based Rollout (GCMR), aiming to bridge inter-layer information synchronization and cooperation by exploiting forward dynamics. Firstly, the GCMR mitigates the state-transition error within off-policy correction via model-based rollout, thereby enhancing sample efficiency. Secondly, to prevent disruption by the unseen…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haoranwang-tj/gcmr_aclg_official
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInsect and Arachnid Ecology and Behavior