SoloGAN: Multi-domain Multimodal Unpaired Image-to-Image Translation via   a Single Generative Adversarial Network

Shihua Huang; Cheng He; Ran Cheng

arXiv:2008.01681·cs.CV·June 30, 2022

SoloGAN: Multi-domain Multimodal Unpaired Image-to-Image Translation via a Single Generative Adversarial Network

Shihua Huang, Cheng He, Ran Cheng

PDF

Open Access

TL;DR

SoloGAN introduces a unified single-generator model for multi-domain, multimodal unpaired image-to-image translation, effectively learning domain-invariant features from all domains to produce diverse, high-quality translated images.

Contribution

The paper proposes a novel SoloGAN framework that uses a single generator and discriminator with shared encoders for multiple domains, improving efficiency and translation quality over existing methods.

Findings

01

Effective translation across multiple domains with a single model

02

Superior performance on datasets with shape variations and complex backgrounds

03

Component analysis confirms the importance of each part of SoloGAN

Abstract

Despite significant advances in image-to-image (I2I) translation with generative adversarial networks (GANs), it remains challenging to effectively translate an image to a set of diverse images in multiple target domains using a single pair of generator and discriminator. Existing I2I translation methods adopt multiple domain-specific content encoders for different domains, where each domain-specific content encoder is trained with images from the same domain only. Nevertheless, we argue that the content (domain-invariance) features should be learned from images among all of the domains. Consequently, each domain-specific content encoder of existing schemes fails to extract the domain-invariant features efficiently. To address this issue, we present a flexible and general SoloGAN model for efficient multimodal I2I translation among multiple domains with unpaired data. In contrast to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCancer-related molecular mechanisms research · Digital Media Forensic Detection · Image Processing Techniques and Applications

MethodsProjection Discriminator