SoloGAN: Multi-domain Multimodal Unpaired Image-to-Image Translation via a Single Generative Adversarial Network
Shihua Huang, Cheng He, Ran Cheng

TL;DR
SoloGAN introduces a unified single-generator model for multi-domain, multimodal unpaired image-to-image translation, effectively learning domain-invariant features from all domains to produce diverse, high-quality translated images.
Contribution
The paper proposes a novel SoloGAN framework that uses a single generator and discriminator with shared encoders for multiple domains, improving efficiency and translation quality over existing methods.
Findings
Effective translation across multiple domains with a single model
Superior performance on datasets with shape variations and complex backgrounds
Component analysis confirms the importance of each part of SoloGAN
Abstract
Despite significant advances in image-to-image (I2I) translation with generative adversarial networks (GANs), it remains challenging to effectively translate an image to a set of diverse images in multiple target domains using a single pair of generator and discriminator. Existing I2I translation methods adopt multiple domain-specific content encoders for different domains, where each domain-specific content encoder is trained with images from the same domain only. Nevertheless, we argue that the content (domain-invariance) features should be learned from images among all of the domains. Consequently, each domain-specific content encoder of existing schemes fails to extract the domain-invariant features efficiently. To address this issue, we present a flexible and general SoloGAN model for efficient multimodal I2I translation among multiple domains with unpaired data. In contrast to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer-related molecular mechanisms research · Digital Media Forensic Detection · Image Processing Techniques and Applications
MethodsProjection Discriminator
