Robust Visual Sim-to-Real Transfer for Robotic Manipulation
Ricardo Garcia, Robin Strudel, Shizhe Chen, Etienne Arlaud, and Ivan Laptev, Cordelia Schmid

TL;DR
This paper presents a systematic study of visual domain randomization for robotic manipulation, demonstrating that off-line optimized DR parameters enable high success rates in sim-to-real transfer of visuomotor policies.
Contribution
The authors propose an off-line proxy task for selecting DR parameters and show its effectiveness in training policies that transfer successfully to real robots, outperforming real-data-only policies.
Findings
Achieved 93% success rate on diverse manipulation tasks.
Off-line proxy task effectively guides DR parameter selection.
Policies trained with DR outperform real-data-only policies in robustness.
Abstract
Learning visuomotor policies in simulation is much safer and cheaper than in the real world. However, due to discrepancies between the simulated and real data, simulator-trained policies often fail when transferred to real robots. One common approach to bridge the visual sim-to-real domain gap is domain randomization (DR). While previous work mainly evaluates DR for disembodied tasks, such as pose estimation and object detection, here we systematically explore visual domain randomization methods and benchmark them on a rich set of challenging robotic manipulation tasks. In particular, we propose an off-line proxy task of cube localization to select DR parameters for texture randomization, lighting randomization, variations of object colors and camera parameters. Notably, we demonstrate that DR parameters have similar impact on our off-line proxy task and on-line policies. We, hence, use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
Methodsfail
