Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout
Haoran Wang, Zeshen Tang, Leya Yang, Yaoru Sun, Fang Wang, Siyu Zhang,, Yeming Chen

TL;DR
This paper introduces GCMR, a goal-conditioned hierarchical reinforcement learning framework that enhances inter-level cooperation and stability through model-based rollout, gradient penalties, and guided planning, leading to improved policy performance.
Contribution
The paper proposes GCMR, a novel HRL framework that integrates model-based rollout, gradient constraints, and one-step planning to improve inter-level cooperation and exploration.
Findings
GCMR improves sample efficiency and policy stability.
GCMR outperforms state-of-the-art algorithms in experiments.
Incorporating GCMR yields more robust hierarchical policies.
Abstract
Goal-conditioned hierarchical reinforcement learning (HRL) presents a promising approach for enabling effective exploration in complex, long-horizon reinforcement learning (RL) tasks through temporal abstraction. Empirically, heightened inter-level communication and coordination can induce more stable and robust policy improvement in hierarchical systems. Yet, most existing goal-conditioned HRL algorithms have primarily focused on the subgoal discovery, neglecting inter-level cooperation. Here, we propose a goal-conditioned HRL framework named Guided Cooperation via Model-based Rollout (GCMR), aiming to bridge inter-layer information synchronization and cooperation by exploiting forward dynamics. Firstly, the GCMR mitigates the state-transition error within off-policy correction via model-based rollout, thereby enhancing sample efficiency. Secondly, to prevent disruption by the unseen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInsect and Arachnid Ecology and Behavior
