XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings
Am\'elie Royer, Konstantinos Bousmalis, Stephan Gouws, Fred Bertsch,, Inbar Mosseri, Forrester Cole, Kevin Murphy

TL;DR
XGAN introduces an unsupervised dual adversarial autoencoder framework for semantic style transfer, enabling many-to-many image translation between unpaired image collections while preserving semantic content.
Contribution
It presents a novel unsupervised method combining domain adaptation and semantic consistency for semantic style transfer across unpaired datasets.
Findings
Promising qualitative results on face-to-cartoon translation
Introduces a new benchmark dataset CartoonSet for semantic style transfer
Demonstrates effective preservation of semantic content during translation
Abstract
Style transfer usually refers to the task of applying color and texture information from a specific style image to a given content image while preserving the structure of the latter. Here we tackle the more generic problem of semantic style transfer: given two unpaired collections of images, we aim to learn a mapping between the corpus-level style of each collection, while preserving semantic content shared across the two domains. We introduce XGAN ("Cross-GAN"), a dual adversarial autoencoder, which captures a shared representation of the common domain semantic content in an unsupervised way, while jointly learning the domain-to-domain image translations in both directions. We exploit ideas from the domain adaptation literature and define a semantic consistency loss which encourages the model to preserve semantics in the learned embedding space. We report promising qualitative results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
