FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Guangxuan Xiao, Tianwei Yin, William T. Freeman, Fr\'edo Durand, Song, Han

TL;DR
FastComposer enables efficient, personalized multi-subject image generation using diffusion models without fine-tuning, employing subject embeddings and attention supervision to produce high-quality images rapidly.
Contribution
It introduces a tuning-free method for multi-subject image generation that localizes attention and delays subject conditioning, improving efficiency and personalization.
Findings
Achieves 300-2500x speedup over fine-tuning methods.
Generates images of multiple unseen individuals with diverse styles and contexts.
Requires no extra storage for new subjects.
Abstract
Diffusion models excel at text-to-image generation, especially in subject-driven generation for personalized images. However, existing methods are inefficient due to the subject-specific fine-tuning, which is computationally intensive and hampers efficient deployment. Moreover, existing methods struggle with multi-subject generation as they often blend features among subjects. We present FastComposer which enables efficient, personalized, multi-subject text-to-image generation without fine-tuning. FastComposer uses subject embeddings extracted by an image encoder to augment the generic text conditioning in diffusion models, enabling personalized image generation based on subject images and textual instructions with only forward passes. To address the identity blending problem in the multi-subject generation, FastComposer proposes cross-attention localization supervision during training,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Advanced Neural Network Applications
MethodsDiffusion
