XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings

Am\'elie Royer; Konstantinos Bousmalis; Stephan Gouws; Fred Bertsch,; Inbar Mosseri; Forrester Cole; Kevin Murphy

arXiv:1711.05139·cs.CV·July 11, 2018·Domain Adaptation for Visual Understanding

XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings

Am\'elie Royer, Konstantinos Bousmalis, Stephan Gouws, Fred Bertsch,, Inbar Mosseri, Forrester Cole, Kevin Murphy

PDF

4 Repos 1 Datasets

TL;DR

XGAN introduces an unsupervised dual adversarial autoencoder framework for semantic style transfer, enabling many-to-many image translation between unpaired image collections while preserving semantic content.

Contribution

It presents a novel unsupervised method combining domain adaptation and semantic consistency for semantic style transfer across unpaired datasets.

Findings

01

Promising qualitative results on face-to-cartoon translation

02

Introduces a new benchmark dataset CartoonSet for semantic style transfer

03

Demonstrates effective preservation of semantic content during translation

Abstract

Style transfer usually refers to the task of applying color and texture information from a specific style image to a given content image while preserving the structure of the latter. Here we tackle the more generic problem of semantic style transfer: given two unpaired collections of images, we aim to learn a mapping between the corpus-level style of each collection, while preserving semantic content shared across the two domains. We introduce XGAN ("Cross-GAN"), a dual adversarial autoencoder, which captures a shared representation of the common domain semantic content in an unsupervised way, while jointly learning the domain-to-domain image translations in both directions. We exploit ideas from the domain adaptation literature and define a semantic consistency loss which encourages the model to preserve semantics in the learned embedding space. We report promising qualitative results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

cgarciae/cartoonset
dataset· 112 dl
112 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.