Bottom-Up Skill Discovery from Unsegmented Demonstrations for   Long-Horizon Robot Manipulation

Yifeng Zhu; Peter Stone; Yuke Zhu

arXiv:2109.13841·cs.RO·January 25, 2022

Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation

Yifeng Zhu, Peter Stone, Yuke Zhu

PDF

Open Access 1 Datasets

TL;DR

This paper introduces a bottom-up skill discovery approach from unsegmented demonstrations, enabling robots to learn and compose reusable skills for long-horizon manipulation tasks efficiently.

Contribution

It presents a hierarchical, agglomerative clustering-based method to learn skills from unsegmented demonstrations and trains a meta-controller for skill composition, requiring minimal human annotation.

Findings

01

Outperforms state-of-the-art imitation learning methods in simulation and real robot tasks.

02

Skills from multi-task demonstrations improve success rates by 8%.

03

Method trains on small datasets within 30 minutes, suitable for real-world use.

Abstract

We tackle real-world long-horizon robot manipulation tasks through skill discovery. We present a bottom-up approach to learning a library of reusable skills from unsegmented demonstrations and use these skills to synthesize prolonged robot behaviors. Our method starts with constructing a hierarchical task structure from each demonstration through agglomerative clustering. From the task structures of multi-task demonstrations, we identify skills based on the recurring patterns and train goal-conditioned sensorimotor policies with hierarchical imitation learning. Finally, we train a meta controller to compose these skills to solve long-horizon manipulation tasks. The entire model can be trained on a small set of human demonstrations collected within 30 minutes without further annotations, making it amendable to real-world deployment. We systematically evaluated our method in simulation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

lerobot/austin_buds_dataset
dataset· 616 dl
616 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Human Pose and Action Recognition