Personalization as a Shortcut for Few-Shot Backdoor Attack against Text-to-Image Diffusion Models
Yihao Huang, Felix Juefei-Xu, Qing Guo, Jie Zhang, Yutong Wu, Ming Hu,, Tianlin Li, Geguang Pu, Yang Liu

TL;DR
This paper uncovers a backdoor vulnerability in text-to-image diffusion models' personalization methods, demonstrating how attackers can exploit these techniques for efficient, stealthy attacks with minimal examples, raising security concerns.
Contribution
The study identifies a zero-day backdoor vulnerability in personalization methods like Textual Inversion and DreamBooth, proposing effective attack strategies and analyzing their impact.
Findings
Nouveau-token backdoor attack is highly effective and stealthy.
Personalization methods can be exploited for precise backdoor attacks.
The attack outperforms legacy-token methods in effectiveness and stealth.
Abstract
Although recent personalization methods have democratized high-resolution image synthesis by enabling swift concept acquisition with minimal examples and lightweight computation, they also present an exploitable avenue for high accessible backdoor attacks. This paper investigates a critical and unexplored aspect of text-to-image (T2I) diffusion models - their potential vulnerability to backdoor attacks via personalization. Our study focuses on a zero-day backdoor vulnerability prevalent in two families of personalization methods, epitomized by Textual Inversion and DreamBooth.Compared to traditional backdoor attacks, our proposed method can facilitate more precise, efficient, and easily accessible attacks with a lower barrier to entry. We provide a comprehensive review of personalization in T2I diffusion models, highlighting the operation and exploitation potential of this backdoor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsChaos-based Image/Signal Encryption
MethodsDiffusion
