Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation
Yifeng Zhu, Peter Stone, Yuke Zhu

TL;DR
This paper introduces a bottom-up skill discovery approach from unsegmented demonstrations, enabling robots to learn and compose reusable skills for long-horizon manipulation tasks efficiently.
Contribution
It presents a hierarchical, agglomerative clustering-based method to learn skills from unsegmented demonstrations and trains a meta-controller for skill composition, requiring minimal human annotation.
Findings
Outperforms state-of-the-art imitation learning methods in simulation and real robot tasks.
Skills from multi-task demonstrations improve success rates by 8%.
Method trains on small datasets within 30 minutes, suitable for real-world use.
Abstract
We tackle real-world long-horizon robot manipulation tasks through skill discovery. We present a bottom-up approach to learning a library of reusable skills from unsegmented demonstrations and use these skills to synthesize prolonged robot behaviors. Our method starts with constructing a hierarchical task structure from each demonstration through agglomerative clustering. From the task structures of multi-task demonstrations, we identify skills based on the recurring patterns and train goal-conditioned sensorimotor policies with hierarchical imitation learning. Finally, we train a meta controller to compose these skills to solve long-horizon manipulation tasks. The entire model can be trained on a small set of human demonstrations collected within 30 minutes without further annotations, making it amendable to real-world deployment. We systematically evaluated our method in simulation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Human Pose and Action Recognition
