Offline Imitation Learning Through Graph Search and Retrieval

Zhao-Heng Yin; Pieter Abbeel

arXiv:2407.15403·cs.RO·July 23, 2024

Offline Imitation Learning Through Graph Search and Retrieval

Zhao-Heng Yin, Pieter Abbeel

PDF

Open Access

TL;DR

The paper introduces GSR, a novel offline imitation learning algorithm that organizes interaction data into a graph, enabling effective learning from suboptimal demonstrations for complex robotic manipulation tasks.

Contribution

GSR combines graph search and retrieval with behavior cloning to improve learning stability and success rates from suboptimal data in robotic manipulation.

Findings

01

GSR achieves 10-30% higher success rates than baselines.

02

GSR demonstrates effectiveness on complex visual manipulation tasks.

03

GSR improves proficiency by over 30% in various experiments.

Abstract

Imitation learning is a powerful machine learning algorithm for a robot to acquire manipulation skills. Nevertheless, many real-world manipulation tasks involve precise and dexterous robot-object interactions, which make it difficult for humans to collect high-quality expert demonstrations. As a result, a robot has to learn skills from suboptimal demonstrations and unstructured interactions, which remains a key challenge. Existing works typically use offline deep reinforcement learning (RL) to solve this challenge, but in practice these algorithms are unstable and fragile due to the deadly triad issue. To overcome this problem, we propose GSR, a simple yet effective algorithm that learns from suboptimal demonstrations through Graph Search and Retrieval. We first use pretrained representation to organize the interaction experience into a graph and perform a graph search to calculate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling