Key-Locked Rank One Editing for Text-to-Image Personalization
Yoad Tewel, Rinon Gal, Gal Chechik, Yuval Atzmon

TL;DR
Perfusion introduces a novel, small, and efficient method for personalized text-to-image generation that maintains high fidelity and allows flexible concept combination through key-locking and gated rank-1 updates.
Contribution
The paper proposes Perfusion, a new T2I personalization technique using dynamic rank-1 updates with key-locking, enabling efficient, flexible, and high-quality concept integration with minimal model size.
Findings
Perfusion outperforms strong baselines in qualitative and quantitative evaluations.
Key-locking allows unprecedented personalization of object interactions.
The method achieves high visual fidelity with a 100KB model, far smaller than existing approaches.
Abstract
Text-to-image models (T2I) offer a new level of flexibility by allowing users to guide the creative process through natural language. However, personalizing these models to align with user-provided visual concepts remains a challenging problem. The task of T2I personalization poses multiple hard challenges, such as maintaining high visual fidelity while allowing creative control, combining multiple personalized concepts in a single image, and keeping a small model size. We present Perfusion, a T2I personalization method that addresses these challenges using dynamic rank-1 updates to the underlying T2I model. Perfusion avoids overfitting by introducing a new mechanism that "locks" new concepts' cross-attention Keys to their superordinate category. Additionally, we develop a gated rank-1 approach that enables us to control the influence of a learned concept during inference time and to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Video Analysis and Summarization
MethodsALIGN
