Image Generation and Translation with Disentangled Representations

Tobias Hinz; Stefan Wermter

arXiv:1803.10567·cs.CV·March 29, 2018

Image Generation and Translation with Disentangled Representations

Tobias Hinz, Stefan Wermter

PDF

TL;DR

This paper presents a unified model capable of controllable image generation and multi-domain image translation by disentangling image representations into structured and unstructured parts, enabling flexible manipulation with minimal labeled data.

Contribution

It introduces a novel model that combines controllable generation and translation across multiple domains using disentangled representations, reducing the need for multiple models.

Findings

01

Able to generate images with specified characteristics

02

Performs effective image-to-image translation across domains

03

Learns unknown data factors with minimal supervision

Abstract

Generative models have made significant progress in the tasks of modeling complex data distributions such as natural images. The introduction of Generative Adversarial Networks (GANs) and auto-encoders lead to the possibility of training on big data sets in an unsupervised manner. However, for many generative models it is not possible to specify what kind of image should be generated and it is not possible to translate existing images into new images of similar domains. Furthermore, models that can perform image-to-image translation often need distinct models for each domain, making it hard to scale these systems to multiple domain image-to-image translation. We introduce a model that can do both, controllable image generation and image-to-image translation between multiple domains. We split our image representation into two parts encoding unstructured and structured information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.