Image Generation and Translation with Disentangled Representations
Tobias Hinz, Stefan Wermter

TL;DR
This paper presents a unified model capable of controllable image generation and multi-domain image translation by disentangling image representations into structured and unstructured parts, enabling flexible manipulation with minimal labeled data.
Contribution
It introduces a novel model that combines controllable generation and translation across multiple domains using disentangled representations, reducing the need for multiple models.
Findings
Able to generate images with specified characteristics
Performs effective image-to-image translation across domains
Learns unknown data factors with minimal supervision
Abstract
Generative models have made significant progress in the tasks of modeling complex data distributions such as natural images. The introduction of Generative Adversarial Networks (GANs) and auto-encoders lead to the possibility of training on big data sets in an unsupervised manner. However, for many generative models it is not possible to specify what kind of image should be generated and it is not possible to translate existing images into new images of similar domains. Furthermore, models that can perform image-to-image translation often need distinct models for each domain, making it hard to scale these systems to multiple domain image-to-image translation. We introduce a model that can do both, controllable image generation and image-to-image translation between multiple domains. We split our image representation into two parts encoding unstructured and structured information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
