HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation
Abdul Basit Anees, Ahmet Canberk Baykal, Muhammed Burak Kizil, Duygu, Ceylan, Erkut Erdem, Aykut Erdem

TL;DR
HyperGAN-CLIP introduces a versatile framework that extends StyleGAN with CLIP integration, enabling domain adaptation, image synthesis, and manipulation driven by reference images or text, with improved quality and flexibility.
Contribution
The paper presents a novel hypernetwork-based extension of StyleGAN that incorporates CLIP space, allowing dynamic domain adaptation and text-guided image manipulation without additional training.
Findings
Outperforms existing methods in domain adaptation and image synthesis.
Enables text-guided image manipulation without text-specific training.
Achieves superior image quality and style transfer flexibility.
Abstract
Generative Adversarial Networks (GANs), particularly StyleGAN and its variants, have demonstrated remarkable capabilities in generating highly realistic images. Despite their success, adapting these models to diverse tasks such as domain adaptation, reference-guided synthesis, and text-guided manipulation with limited training data remains challenging. Towards this end, in this study, we present a novel framework that significantly extends the capabilities of a pre-trained StyleGAN by integrating CLIP space via hypernetworks. This integration allows dynamic adaptation of StyleGAN to new domains defined by reference images or textual descriptions. Additionally, we introduce a CLIP-guided discriminator that enhances the alignment between generated images and target domains, ensuring superior image quality. Our approach demonstrates unprecedented flexibility, enabling text-guided image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis
MethodsDense Connections · R1 Regularization · HuMan(Expedia)||How do I get a human at Expedia? · Convolution · Feedforward Network · Adaptive Instance Normalization · Contrastive Language-Image Pre-training · StyleGAN
