Offline Imitation Learning Through Graph Search and Retrieval
Zhao-Heng Yin, Pieter Abbeel

TL;DR
The paper introduces GSR, a novel offline imitation learning algorithm that organizes interaction data into a graph, enabling effective learning from suboptimal demonstrations for complex robotic manipulation tasks.
Contribution
GSR combines graph search and retrieval with behavior cloning to improve learning stability and success rates from suboptimal data in robotic manipulation.
Findings
GSR achieves 10-30% higher success rates than baselines.
GSR demonstrates effectiveness on complex visual manipulation tasks.
GSR improves proficiency by over 30% in various experiments.
Abstract
Imitation learning is a powerful machine learning algorithm for a robot to acquire manipulation skills. Nevertheless, many real-world manipulation tasks involve precise and dexterous robot-object interactions, which make it difficult for humans to collect high-quality expert demonstrations. As a result, a robot has to learn skills from suboptimal demonstrations and unstructured interactions, which remains a key challenge. Existing works typically use offline deep reinforcement learning (RL) to solve this challenge, but in practice these algorithms are unstable and fragile due to the deadly triad issue. To overcome this problem, we propose GSR, a simple yet effective algorithm that learns from suboptimal demonstrations through Graph Search and Retrieval. We first use pretrained representation to organize the interaction experience into a graph and perform a graph search to calculate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling
