When StyleGAN Meets Stable Diffusion: a $\mathscr{W}_+$ Adapter for Personalized Image Generation
Xiaoming Li, Xinyu Hou, Chen Change Loy

TL;DR
This paper introduces a novel method using the StyleGAN $ ext{W}_+$ space to improve personalized image generation with diffusion models, achieving better identity preservation and disentanglement for facial images.
Contribution
It proposes a new approach that aligns StyleGAN's $ ext{W}_+$ space with diffusion models, enhancing identity fidelity and semantic editing capabilities in generated images.
Findings
Enhanced identity preservation in generated faces.
Improved disentanglement of facial attributes from background.
Compatibility with prompt descriptions and StyleGAN editing directions.
Abstract
Text-to-image diffusion models have remarkably excelled in producing diverse, high-quality, and photo-realistic images. This advancement has spurred a growing interest in incorporating specific identities into generated content. Most current methods employ an inversion approach to embed a target visual concept into the text embedding space using a single reference image. However, the newly synthesized faces either closely resemble the reference image in terms of facial attributes, such as expression, or exhibit a reduced capacity for identity preservation. Text descriptions intended to guide the facial attributes of the synthesized face may fall short, owing to the intricate entanglement of identity information with identity-irrelevant facial attributes derived from the reference image. To address these issues, we present the novel use of the extended StyleGAN embedding space…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Computational and Text Analysis Methods
MethodsDense Connections · HuMan(Expedia)||How do I get a human at Expedia? · Feedforward Network · Adaptive Instance Normalization · R1 Regularization · Convolution · Diffusion · StyleGAN
