Learning Contact-Rich Manipulation Skills with Guided Policy Search
Sergey Levine, Nolan Wagener, Pieter Abbeel

TL;DR
This paper presents a method for robots to learn complex, contact-rich manipulation skills quickly and without demonstrations by unifying multiple learned trajectories into a generalizable policy.
Contribution
It extends a policy search approach to learn dynamic manipulation behaviors with flexible policies, reducing sample needs and automating parameter tuning for real-world robot applications.
Findings
Learns manipulation skills within minutes of interaction.
Successfully performs complex tasks like stacking and screwing.
Produces robust controllers for diverse contact-rich activities.
Abstract
Autonomous learning of object manipulation skills can enable robots to acquire rich behavioral repertoires that scale to the variety of objects found in the real world. However, current motion skill learning methods typically restrict the behavior to a compact, low-dimensional representation, limiting its expressiveness and generality. In this paper, we extend a recently developed policy search method \cite{la-lnnpg-14} and use it to learn a range of dynamic manipulation behaviors with highly general policy representations, without using known models or example demonstrations. Our approach learns a set of trajectories for the desired motion skill by using iteratively refitted time-varying linear models, and then unifies these trajectories into a single control policy that can generalize to new situations. To enable this method to run on a real robot, we introduce several improvements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Human Pose and Action Recognition
