XOGAN: One-to-Many Unsupervised Image-to-Image Translation
Yongqi Zhang

TL;DR
XOGAN is a novel unsupervised image translation model that learns one-to-many mappings, allowing for diverse and controllable image generation across domains without paired data.
Contribution
Introduces the XOGAN model with an additional variable and XO-structure to enable one-to-many unsupervised image translation with controllable variations.
Findings
XOGAN generates diverse, plausible images with controllable variations.
It outperforms state-of-the-art methods in diversity of generated images.
Effective on edges-to-objects and facial image translation tasks.
Abstract
Unsupervised image-to-image translation aims at learning the relationship between samples from two image domains without supervised pair information. The relationship between two domain images can be one-to-one, one-to-many or many-to-many. In this paper, we study the one-to-many unsupervised image translation problem in which an input sample from one domain can correspond to multiple samples in the other domain. To learn the complex relationship between the two domains, we introduce an additional variable to control the variations in our one-to-many mapping. A generative model with an XO-structure, called the XOGAN, is proposed to learn the cross domain relationship among the two domains and the ad- ditional variables. Not only can we learn to translate between the two image domains, we can also handle the translated images with additional variations. Experiments are performed on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Cancer-related molecular mechanisms research · Advanced Image Processing Techniques
