Personalization as a Shortcut for Few-Shot Backdoor Attack against   Text-to-Image Diffusion Models

Yihao Huang; Felix Juefei-Xu; Qing Guo; Jie Zhang; Yutong Wu; Ming Hu,; Tianlin Li; Geguang Pu; Yang Liu

arXiv:2305.10701·cs.CV·December 21, 2023·1 cites

Personalization as a Shortcut for Few-Shot Backdoor Attack against Text-to-Image Diffusion Models

Yihao Huang, Felix Juefei-Xu, Qing Guo, Jie Zhang, Yutong Wu, Ming Hu,, Tianlin Li, Geguang Pu, Yang Liu

PDF

Open Access

TL;DR

This paper uncovers a backdoor vulnerability in text-to-image diffusion models' personalization methods, demonstrating how attackers can exploit these techniques for efficient, stealthy attacks with minimal examples, raising security concerns.

Contribution

The study identifies a zero-day backdoor vulnerability in personalization methods like Textual Inversion and DreamBooth, proposing effective attack strategies and analyzing their impact.

Findings

01

Nouveau-token backdoor attack is highly effective and stealthy.

02

Personalization methods can be exploited for precise backdoor attacks.

03

The attack outperforms legacy-token methods in effectiveness and stealth.

Abstract

Although recent personalization methods have democratized high-resolution image synthesis by enabling swift concept acquisition with minimal examples and lightweight computation, they also present an exploitable avenue for high accessible backdoor attacks. This paper investigates a critical and unexplored aspect of text-to-image (T2I) diffusion models - their potential vulnerability to backdoor attacks via personalization. Our study focuses on a zero-day backdoor vulnerability prevalent in two families of personalization methods, epitomized by Textual Inversion and DreamBooth.Compared to traditional backdoor attacks, our proposed method can facilitate more precise, efficient, and easily accessible attacks with a lower barrier to entry. We provide a comprehensive review of personalization in T2I diffusion models, highlighting the operation and exploitation potential of this backdoor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsChaos-based Image/Signal Encryption

MethodsDiffusion