Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models
Jingyao Xu, Yuetong Lu, Yandong Li, Siyang Lu, Dongdong Wang, Xiang, Wei

TL;DR
This paper introduces CAAT, a fast and effective adversarial attack method that subtly perturbs images to significantly disrupt the output of diffusion models by exploiting cross-attention layer sensitivities.
Contribution
The paper presents CAAT, a novel, training-free attack method that efficiently fools diffusion models by targeting cross-attention layers with subtle image perturbations.
Findings
CAAT outperforms baseline attacks in effectiveness and speed.
Subtle image perturbations can significantly alter diffusion model outputs.
CAAT is compatible with various diffusion models.
Abstract
Diffusion models (DMs) embark a new era of generative modeling and offer more opportunities for efficient generating high-quality and realistic data samples. However, their widespread use has also brought forth new challenges in model security, which motivates the creation of more effective adversarial attackers on DMs to understand its vulnerability. We propose CAAT, a simple but generic and efficient approach that does not require costly training to effectively fool latent diffusion models (LDMs). The approach is based on the observation that cross-attention layers exhibits higher sensitivity to gradient change, allowing for leveraging subtle perturbations on published images to significantly corrupt the generated images. We show that a subtle perturbation on an image can significantly impact the cross-attention layers, thus changing the mapping between text and image during the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic processes and financial applications · Advanced Materials Characterization Techniques
MethodsDiffusion
