Hierarchical Reinforcement Learning Based on Planning Operators

Jing Zhang; Emmanuel Dean; Karinne Ramirez-Amaro

arXiv:2309.14237·cs.RO·July 1, 2024

Hierarchical Reinforcement Learning Based on Planning Operators

Jing Zhang, Emmanuel Dean, Karinne Ramirez-Amaro

PDF

Open Access

TL;DR

This paper presents a hierarchical reinforcement learning framework that combines symbolic planning operators with low-level policies to improve long-horizon robotic manipulation tasks like stacking, achieving high success rates and reduced training time.

Contribution

The paper introduces a novel integration of planning operators into hierarchical RL using SAC-X, enabling flexible, reusable policies for complex manipulation tasks.

Findings

01

97.2% success rate in stacking tasks

02

Training time reduced by 68%

03

High success rates in individual actions like reach and lift

Abstract

Long-horizon manipulation tasks such as stacking represent a longstanding challenge in the field of robotic manipulation, particularly when using reinforcement learning (RL) methods which often struggle to learn the correct sequence of actions for achieving these complex goals. To learn this sequence, symbolic planning methods offer a good solution based on high-level reasoning, however, planners often fall short in addressing the low-level control specificity needed for precise execution. This paper introduces a novel framework that integrates symbolic planning with hierarchical RL through the cooperation of high-level operators and low-level policies. Our contribution integrates planning operators (e.g. preconditions and effects) as part of the hierarchical RL algorithm based on the Scheduled Auxiliary Control (SAC-X) method. We developed a dual-purpose high-level operator, which can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Viral Infectious Diseases and Gene Expression in Insects