DemoGen: Synthetic Demonstration Generation for Data-Efficient   Visuomotor Policy Learning

Zhengrong Xue; Shuying Deng; Zhenyang Chen; Yixuan Wang; Zhecheng; Yuan; Huazhe Xu

arXiv:2502.16932·cs.RO·February 25, 2025

DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learning

Zhengrong Xue, Shuying Deng, Zhenyang Chen, Yixuan Wang, Zhecheng, Yuan, Huazhe Xu

PDF

Open Access

TL;DR

DemoGen is a synthetic demonstration generation method that improves visuomotor policy learning by augmenting limited human demonstrations with spatially and visually diverse synthetic data, enhancing generalization and robustness.

Contribution

DemoGen introduces a fully synthetic, low-cost approach for automatic demonstration generation that significantly boosts policy performance with minimal human data.

Findings

01

Enhances policy performance across various manipulation tasks

02

Effective with only one human demonstration per task

03

Enables out-of-distribution capabilities like obstacle avoidance

Abstract

Visuomotor policies have shown great promise in robotic manipulation but often require substantial amounts of human-collected data for effective performance. A key reason underlying the data demands is their limited spatial generalization capability, which necessitates extensive data collection across different object configurations. In this work, we present DemoGen, a low-cost, fully synthetic approach for automatic demonstration generation. Using only one human-collected demonstration per task, DemoGen generates spatially augmented demonstrations by adapting the demonstrated action trajectory to novel object configurations. Visual observations are synthesized by leveraging 3D point clouds as the modality and rearranging the subjects in the scene via 3D editing. Empirically, DemoGen significantly enhances policy performance across a diverse range of real-world manipulation tasks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Human Pose and Action Recognition