Semantically Controllable Augmentations for Generalizable Robot Learning
Zoey Chen, Zhao Mandi, Homanga Bharadhwaj, Mohit Sharma, Shuran Song,, Abhishek Gupta, Vikash Kumar

TL;DR
This paper introduces a novel data augmentation framework using pre-trained image-text generative models to improve robot manipulation policy generalization in unseen real-world environments, reducing the need for extensive real-world data collection.
Contribution
It presents a semantically controllable augmentation method leveraging generative models to synthesize diverse training data, enhancing robot learning generalization without additional data collection costs.
Findings
Enhanced policy generalization in unseen environments
Effective augmentation of robot datasets with synthetic data
Scalable approach applicable to various robotic tasks
Abstract
Generalization to unseen real-world scenarios for robot manipulation requires exposure to diverse datasets during training. However, collecting large real-world datasets is intractable due to high operational costs. For robot learning to generalize despite these challenges, it is essential to leverage sources of data or priors beyond the robot's direct experience. In this work, we posit that image-text generative models, which are pre-trained on large corpora of web-scraped data, can serve as such a data source. These generative models encompass a broad range of real-world scenarios beyond a robot's direct experience and can synthesize novel synthetic experiences that expose robotic agents to additional world priors aiding real-world generalization at no extra cost. In particular, our approach leverages pre-trained generative models as an effective tool for data augmentation. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Robot Manipulation and Learning · Machine Learning and Algorithms
