Towards Open-World Text-Guided Face Image Generation and Manipulation
Weihao Xia, Yujiu Yang, Jing-Hao Xue, Baoyuan Wu

TL;DR
This paper introduces a high-resolution, open-world text-guided face image generation and manipulation framework that leverages pretrained GANs and language models, enabling diverse, high-quality outputs without re-training.
Contribution
It presents a novel paradigm combining pretrained GANs and language models for open-world, multi-modal face image synthesis and editing at 1024 resolution.
Findings
Achieves 1024 resolution face image generation and manipulation.
Supports open-world scenarios with no re-training or fine-tuning.
Outperforms existing methods on a new multi-modal CelebA-HQ dataset.
Abstract
The existing text-guided image synthesis methods can only produce limited quality results with at most \mbox{} resolution and the textual instructions are constrained in a small Corpus. In this work, we propose a unified framework for both face image generation and manipulation that produces diverse and high-quality images with an unprecedented resolution at 1024 from multimodal inputs. More importantly, our method supports open-world scenarios, including both image and text, without any re-training, fine-tuning, or post-processing. To be specific, we propose a brand new paradigm of text-guided image generation and manipulation based on the superior characteristics of a pretrained GAN model. Our proposed paradigm includes two novel strategies. The first strategy is to train a text encoder to obtain latent codes that align with the hierarchically semantic of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Multimodal Machine Learning Applications
