Scaling Single Human Demonstrations for Imitation Learning using Generative Foundational Models
Nick Heppert, Minh Quang Nguyen, Abhinav Valada

TL;DR
This paper introduces Real2Gen, a method that uses a single human demonstration to generate unlimited training data in simulation, enabling effective robot manipulation policies with improved success rates and zero-shot real-world deployment.
Contribution
Real2Gen is a novel approach that transfers a single human demonstration into a simulation environment to generate extensive training data for imitation learning.
Findings
26.6% average increase in success rate
Better generalization of policies
Zero-shot real-world deployment achieved
Abstract
Imitation learning is a popular paradigm to teach robots new tasks, but collecting robot demonstrations through teleoperation or kinesthetic teaching is tedious and time-consuming. In contrast, directly demonstrating a task using our human embodiment is much easier and data is available in abundance, yet transfer to the robot can be non-trivial. In this work, we propose Real2Gen to train a manipulation policy from a single human demonstration. Real2Gen extracts required information from the demonstration and transfers it to a simulation environment, where a programmable expert agent can demonstrate the task arbitrarily many times, generating an unlimited amount of data to train a flow matching policy. We evaluate Real2Gen on human demonstrations from three different real-world tasks and compare it to a recent baseline. Real2Gen shows an average increase in the success rate of 26.6% and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Social Robot Interaction and HRI
