Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations

Seogkyu Jeon; Bei Liu; Pilhyeon Lee; Kibeom Hong; Jianlong Fu; Hyeran; Byun

arXiv:2308.10554·cs.CV·August 22, 2023

Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations

Seogkyu Jeon, Bei Liu, Pilhyeon Lee, Kibeom Hong, Jianlong Fu, Hyeran, Byun

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel approach to enhance diversity in zero-shot GAN adaptation by leveraging semantic variations in CLIP space, resulting in improved sample diversity and quality without additional training data.

Contribution

It proposes a method to find semantic variations in CLIP space and introduces a directional moment loss, elastic weight consolidation, and relation consistency loss to improve diversity and content preservation.

Findings

01

Achieves state-of-the-art diversity and quality in zero-shot GAN adaptation.

02

Effectively maintains source domain content while enhancing diversity.

03

Demonstrates robustness across various adaptation scenarios.

Abstract

Training deep generative models usually requires a large amount of data. To alleviate the data collection cost, the task of zero-shot GAN adaptation aims to reuse well-trained generators to synthesize images of an unseen target domain without any further training samples. Due to the data absence, the textual description of the target domain and the vision-language models, e.g., CLIP, are utilized to effectively guide the generator. However, with only a single representative text feature instead of real images, the synthesized images gradually lose diversity as the model is optimized, which is also known as mode collapse. To tackle the problem, we propose a novel method to find semantic variations of the target text in the CLIP space. Specifically, we explore diverse semantic variations based on the informative text feature of the target domain while regularizing the uncontrolled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations· youtube

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Computational and Text Analysis Methods

MethodsContrastive Language-Image Pre-training