Autoencoder for Synthetic to Real Generalization: From Simple to More   Complex Scenes

Steve Dias Da Cruz; Bertram Taetz; Thomas Stifter; Didier Stricker

arXiv:2204.00386·cs.CV·April 4, 2022

Autoencoder for Synthetic to Real Generalization: From Simple to More Complex Scenes

Steve Dias Da Cruz, Bertram Taetz, Thomas Stifter, Didier Stricker

PDF

Open Access 1 Repo

TL;DR

This paper proposes an autoencoder-based approach to improve the generalization from synthetic to real images, especially in complex scenes, by using semantic matching sampling techniques to enhance latent space invariance.

Contribution

It introduces a novel sampling method that enhances autoencoder generalization from synthetic to real images, outperforming fine-tuned classifiers on complex scenes.

Findings

01

Pre-trained feature extractors work well on simple scenes.

02

Semantic matching sampling improves real-world generalization.

03

The approach outperforms fine-tuned classification models on complex data.

Abstract

Learning on synthetic data and transferring the resulting properties to their real counterparts is an important challenge for reducing costs and increasing safety in machine learning. In this work, we focus on autoencoder architectures and aim at learning latent space representations that are invariant to inductive biases caused by the domain shift between simulated and real images showing the same scenario. We train on synthetic images only, present approaches to increase generalizability and improve the preservation of the semantics to real datasets of increasing visual complexity. We show that pre-trained feature extractors (e.g. VGG) can be sufficient for generalization on images of lower complexity, but additional improvements are required for visually more complex scenes. To this end, we demonstrate a new sampling technique, which matches semantically important parts of the image,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stevecruz/icpr2022-autoencoder-syn2real
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · AI in cancer detection