PromptFlare: Prompt-Generalized Defense via Cross-Attention Decoy in Diffusion-Based Inpainting
Hohyun Na, Seunghoo Hong, Simon S. Woo

TL;DR
PromptFlare introduces a novel adversarial defense mechanism for diffusion-based image inpainting that uses cross-attention decoys to neutralize malicious prompt-driven modifications, enhancing security and efficiency.
Contribution
The paper presents a new prompt-based adversarial protection method leveraging cross-attention decoys, addressing limitations of previous image-level approaches.
Findings
Achieves state-of-the-art defense performance on EditBench dataset.
Reduces computational overhead and GPU memory usage.
Provides robust protection against prompt-based image manipulation.
Abstract
The success of diffusion models has enabled effortless, high-quality image modifications that precisely align with users' intentions, thereby raising concerns about their potential misuse by malicious actors. Previous studies have attempted to mitigate such misuse through adversarial attacks. However, these approaches heavily rely on image-level inconsistencies, which pose fundamental limitations in addressing the influence of textual prompts. In this paper, we propose PromptFlare, a novel adversarial protection method designed to protect images from malicious modifications facilitated by diffusion-based inpainting models. Our approach leverages the cross-attention mechanism to exploit the intrinsic properties of prompt embeddings. Specifically, we identify and target shared token of prompts that is invariant and semantically uninformative, injecting adversarial noise to suppress the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
