HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image   Synthesis and Manipulation

Abdul Basit Anees; Ahmet Canberk Baykal; Muhammed Burak Kizil; Duygu; Ceylan; Erkut Erdem; Aykut Erdem

arXiv:2411.12832·cs.CV·November 21, 2024

HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation

Abdul Basit Anees, Ahmet Canberk Baykal, Muhammed Burak Kizil, Duygu, Ceylan, Erkut Erdem, Aykut Erdem

PDF

Open Access 1 Repo

TL;DR

HyperGAN-CLIP introduces a versatile framework that extends StyleGAN with CLIP integration, enabling domain adaptation, image synthesis, and manipulation driven by reference images or text, with improved quality and flexibility.

Contribution

The paper presents a novel hypernetwork-based extension of StyleGAN that incorporates CLIP space, allowing dynamic domain adaptation and text-guided image manipulation without additional training.

Findings

01

Outperforms existing methods in domain adaptation and image synthesis.

02

Enables text-guided image manipulation without text-specific training.

03

Achieves superior image quality and style transfer flexibility.

Abstract

Generative Adversarial Networks (GANs), particularly StyleGAN and its variants, have demonstrated remarkable capabilities in generating highly realistic images. Despite their success, adapting these models to diverse tasks such as domain adaptation, reference-guided synthesis, and text-guided manipulation with limited training data remains challenging. Towards this end, in this study, we present a novel framework that significantly extends the capabilities of a pre-trained StyleGAN by integrating CLIP space via hypernetworks. This integration allows dynamic adaptation of StyleGAN to new domains defined by reference images or textual descriptions. Additionally, we introduce a CLIP-guided discriminator that enhances the alignment between generated images and target domains, ensuring superior image quality. Our approach demonstrates unprecedented flexibility, enabling text-guided image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adobe-research/domain-expansion
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis

MethodsDense Connections · R1 Regularization · HuMan(Expedia)||How do I get a human at Expedia? · Convolution · Feedforward Network · Adaptive Instance Normalization · Contrastive Language-Image Pre-training · StyleGAN