Visual Imitation Made Easy
Sarah Young, Dhiraj Gandhi, Shubham Tulsiani, Abhinav Gupta, Pieter, Abbeel, Lerrel Pinto

TL;DR
This paper introduces a simplified visual imitation learning interface using assistive tools and SfM techniques, enabling efficient data collection and successful policy learning for complex manipulation tasks with robots.
Contribution
It presents a novel, easy-to-use imitation data collection method with off-the-shelf tools, facilitating large-scale demonstrations and transfer to real robots.
Findings
Achieved 87% success in pushing tasks.
Achieved 62% success in stacking tasks.
Demonstrated effective policy learning from diverse offline demonstrations.
Abstract
Visual imitation learning provides a framework for learning complex manipulation behaviors by leveraging human demonstrations. However, current interfaces for imitation such as kinesthetic teaching or teleoperation prohibitively restrict our ability to efficiently collect large-scale data in the wild. Obtaining such diverse demonstration data is paramount for the generalization of learned skills to novel scenarios. In this work, we present an alternate interface for imitation that simplifies the data collection process while allowing for easy transfer to robots. We use commercially available reacher-grabber assistive tools both as a data collection device and as the robot's end-effector. To extract action information from these visual demonstrations, we use off-the-shelf Structure from Motion (SfM) techniques in addition to training a finger detection network. We experimentally evaluate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Multimodal Machine Learning Applications
