Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface
Yujie Zhao, Hongwei Fan, Di Chen, Shengcong Chen, Liliang Chen, Xiaoqi Li, Guanghui Ren, Hao Dong

TL;DR
Real2Edit2Real is a framework that creates diverse robotic demonstrations by editing 3D scene geometry and synthesizing manipulation videos, significantly reducing data collection costs and enhancing policy robustness.
Contribution
The paper introduces a novel 3D editing and video synthesis framework that generates diverse manipulation demonstrations from minimal source data.
Findings
Policies trained on 1-5 generated demonstrations outperform those trained on 50 real demonstrations.
The framework achieves 10-50x data efficiency improvement.
Demonstrates flexibility in height and texture editing for data augmentation.
Abstract
Recent progress in robot learning has been driven by large-scale datasets and powerful visuomotor policy architectures, yet policy robustness remains limited by the substantial cost of collecting diverse demonstrations, particularly for spatial generalization in manipulation tasks. To reduce repetitive data collection, we present Real2Edit2Real, a framework that generates new demonstrations by bridging 3D editability with 2D visual data through a 3D control interface. Our approach first reconstructs scene geometry from multi-view RGB observations with a metric-scale 3D reconstruction model. Based on the reconstructed geometry, we perform depth-reliable 3D editing on point clouds to generate new manipulation trajectories while geometrically correcting the robot poses to recover physically consistent depth, which serves as a reliable condition for synthesizing new demonstrations. Finally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis
