One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
Tianhe Yu, Chelsea Finn, Annie Xie, Sudeep Dasari, Tianhao Zhang,, Pieter Abbeel, Sergey Levine

TL;DR
This paper introduces a meta-learning approach enabling robots to learn new manipulation tasks from a single human demonstration video, despite domain differences, by leveraging prior knowledge from previous tasks.
Contribution
It presents a domain-adaptive meta-learning method that allows robots to perform one-shot imitation learning directly from raw human videos without explicit pose detection.
Findings
Robots successfully learned to place, push, and pick-and-place objects from a single human video.
Meta-learning improved the robot's ability to generalize across different tasks and domain shifts.
Experiments on PR2 and Sawyer arms demonstrated practical applicability.
Abstract
Humans and animals are capable of learning a new behavior by observing others perform the skill just once. We consider the problem of allowing a robot to do the same -- learning from a raw video pixels of a human, even when there is substantial domain shift in the perspective, environment, and embodiment between the robot and the observed human. Prior approaches to this problem have hand-specified how human and robot actions correspond and often relied on explicit human pose detection systems. In this work, we present an approach for one-shot learning from a video of a human by using human and robot demonstration data from a variety of previous tasks to build up prior knowledge through meta-learning. Then, combining this prior knowledge and only a single video demonstration from a human, the robot can perform the task that the human demonstrated. We show experiments on both a PR2 arm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning
