Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning

Seungho Baek; Taegeon Park; Jongchan Park; Seungjun Oh; Yusung Kim

arXiv:2506.07744·cs.LG·July 8, 2025

Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning

Seungho Baek, Taegeon Park, Jongchan Park, Seungjun Oh, Yusung Kim

PDF

Open Access 1 Repo 1 Models

TL;DR

The paper introduces Graph-Assisted Stitching (GAS), a novel offline hierarchical reinforcement learning framework that uses graph search and state clustering to improve task efficiency and performance across various domains.

Contribution

GAS formulates subgoal selection as a graph search problem using state clustering in TDR space, bypassing high-level policy learning limitations.

Findings

01

GAS significantly outperforms prior methods in locomotion, navigation, and manipulation tasks.

02

In the most critical task, GAS achieves a score of 88.3, far exceeding the previous 1.0.

03

The Temporal Efficiency metric improves graph quality and task success.

Abstract

Existing offline hierarchical reinforcement learning methods rely on high-level policy learning to generate subgoal sequences. However, their efficiency degrades as task horizons increase, and they lack effective strategies for stitching useful state transitions across different trajectories. We propose Graph-Assisted Stitching (GAS), a novel framework that formulates subgoal selection as a graph search problem rather than learning an explicit high-level policy. By embedding states into a Temporal Distance Representation (TDR) space, GAS clusters semantically similar states from different trajectories into unified graph nodes, enabling efficient transition stitching. A shortest-path algorithm is then applied to select subgoal sequences within the graph, while a low-level policy learns to reach the subgoals. To improve graph quality, we introduce the Temporal Efficiency (TE) metric,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qortmdgh4141/gas
noneOfficial

Models

🤗
qortmdgh4141/GAS
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Human Pose and Action Recognition