InstantID: Zero-shot Identity-Preserving Generation in Seconds
Qixun Wang, Xu Bai, Haofan Wang, Zekui Qin, Anthony Chen, Huaxia Li,, Xu Tang, and Yao Hu

TL;DR
InstantID introduces a fast, high-fidelity, zero-shot image personalization method using a diffusion model that requires only a single facial image and integrates seamlessly with existing pre-trained models.
Contribution
We propose InstantID, a novel diffusion-based plug-and-play module that enables identity-preserving image generation from a single image without extensive fine-tuning.
Findings
Achieves high face fidelity with minimal reference images.
Compatible with popular pre-trained diffusion models like SD1.5 and SDXL.
Offers efficient, real-time image personalization in practical applications.
Abstract
There has been significant progress in personalized image synthesis with methods such as Textual Inversion, DreamBooth, and LoRA. Yet, their real-world applicability is hindered by high storage demands, lengthy fine-tuning processes, and the need for multiple reference images. Conversely, existing ID embedding-based methods, while requiring only a single forward inference, face challenges: they either necessitate extensive fine-tuning across numerous model parameters, lack compatibility with community pre-trained models, or fail to maintain high face fidelity. Addressing these limitations, we introduce InstantID, a powerful diffusion model-based solution. Our plug-and-play module adeptly handles image personalization in various styles using just a single facial image, while ensuring high fidelity. To achieve this, we design a novel IdentityNet by imposing strong semantic and weak…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗InstantX/InstantIDmodel· 42k dl· ♡ 84842k dl♡ 848
- 🤗JCTN/InstantIDmodel· 52 dl· ♡ 152 dl♡ 1
- 🤗Gokuldaskumar/instantIDmodel· 1 dl1 dl
- 🤗krnl/InstantIDmodel· 48 dl48 dl
- 🤗ModelsLab/InstantIDmodel· 155 dl155 dl
- 🤗Norman-ou/GeoPix-ft-sior_rsicapmodel· 53 dl· ♡ 153 dl♡ 1
- 🤗shangguanyanyan/InstantID-custommodel· 1 dl1 dl
- 🤗KarthikAI/InstantID-img2imgmodel
- 🤗feixiastar/InstantIDmodel
- 🤗kristian1515/my-private-modelmodel· 9 dl9 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · AI in cancer detection
MethodsDiffusion
