Synthetic-to-Real Domain Adaptation using Contrastive Unpaired Translation
Benedikt T. Imbusch, Max Schwarz, Sven Behnke

TL;DR
This paper introduces a multi-step approach combining synthetic data generation and contrastive unpaired image translation to reduce domain gap, enabling effective training of deep models for robotics without manual annotation.
Contribution
It presents a novel pipeline that synthesizes data from 3D meshes and adapts it to real images using contrastive unpaired translation, improving domain adaptation in robotics.
Findings
Enhanced synthetic-to-real translation quality
Reduced training time with patch-based methods
Improved performance on robotic datasets
Abstract
The usefulness of deep learning models in robotics is largely dependent on the availability of training data. Manual annotation of training data is often infeasible. Synthetic data is a viable alternative, but suffers from domain gap. We propose a multi-step method to obtain training data without manual annotation effort: From 3D object meshes, we generate images using a modern synthesis pipeline. We utilize a state-of-the-art image-to-image translation method to adapt the synthetic images to the real domain, minimizing the domain gap in a learned manner. The translation network is trained from unpaired images, i.e. just requires an un-annotated collection of real images. The generated and refined images can then be used to train deep learning models for a particular task. We also propose and evaluate extensions to the translation method that further increase performance, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
