Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations
Seogkyu Jeon, Bei Liu, Pilhyeon Lee, Kibeom Hong, Jianlong Fu, Hyeran, Byun

TL;DR
This paper introduces a novel approach to enhance diversity in zero-shot GAN adaptation by leveraging semantic variations in CLIP space, resulting in improved sample diversity and quality without additional training data.
Contribution
It proposes a method to find semantic variations in CLIP space and introduces a directional moment loss, elastic weight consolidation, and relation consistency loss to improve diversity and content preservation.
Findings
Achieves state-of-the-art diversity and quality in zero-shot GAN adaptation.
Effectively maintains source domain content while enhancing diversity.
Demonstrates robustness across various adaptation scenarios.
Abstract
Training deep generative models usually requires a large amount of data. To alleviate the data collection cost, the task of zero-shot GAN adaptation aims to reuse well-trained generators to synthesize images of an unseen target domain without any further training samples. Due to the data absence, the textual description of the target domain and the vision-language models, e.g., CLIP, are utilized to effectively guide the generator. However, with only a single representative text feature instead of real images, the synthesized images gradually lose diversity as the model is optimized, which is also known as mode collapse. To tackle the problem, we propose a novel method to find semantic variations of the target text in the CLIP space. Specifically, we explore diverse semantic variations based on the informative text feature of the target domain while regularizing the uncontrolled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations· youtube
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Computational and Text Analysis Methods
MethodsContrastive Language-Image Pre-training
