Goal-Reaching Policy Learning from Non-Expert Observations via Effective   Subgoal Guidance

RenMing Huang; Shaochong Liu; Yunqiang Pei; Peng Wang; Guoqing Wang,; Yang Yang; Hengtao Shen

arXiv:2409.03996·cs.LG·September 9, 2024

Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance

RenMing Huang, Shaochong Liu, Yunqiang Pei, Peng Wang, Guoqing Wang,, Yang Yang, Hengtao Shen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel subgoal guidance strategy for goal-reaching policy learning from non-expert, action-free observations, improving exploration efficiency and performance in complex robotic tasks.

Contribution

It proposes a diffusion strategy-based high-level policy for generating subgoals and integrates state-goal value functions into an off-policy framework, addressing the challenge of learning from non-expert data.

Findings

01

Significant performance improvements over existing methods.

02

Robustness to observation data corruptions.

03

Effective exploration in complex robotic tasks.

Abstract

In this work, we address the challenging problem of long-horizon goal-reaching policy learning from non-expert, action-free observation data. Unlike fully labeled expert data, our data is more accessible and avoids the costly process of action labeling. Additionally, compared to online learning, which often involves aimless exploration, our data provides useful guidance for more efficient exploration. To achieve our goal, we propose a novel subgoal guidance learning strategy. The motivation behind this strategy is that long-horizon goals offer limited guidance for efficient exploration and accurate state transition. We develop a diffusion strategy-based high-level policy to generate reasonable subgoals as waypoints, preferring states that more easily lead to the final goal. Additionally, we learn state-goal value functions to encourage efficient subgoal reaching. These two components…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RenMing-Huang/EGR-PO
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Decision-Making and Behavioral Economics · Explainable Artificial Intelligence (XAI)

MethodsDiffusion