Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion   Models

Changjiang Li; Ren Pang; Bochuan Cao; Jinghui Chen; Fenglong Ma,; Shouling Ji; Ting Wang

arXiv:2406.09669·cs.CR·June 17, 2024·1 cites

Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion Models

Changjiang Li, Ren Pang, Bochuan Cao, Jinghui Chen, Fenglong Ma,, Shouling Ji, Ting Wang

PDF

Open Access

TL;DR

This paper reveals that diffusion models used for security purposes are vulnerable to backdoor attacks, which can undermine their effectiveness in defending against adversarial threats.

Contribution

It introduces DIFF2, a novel backdoor attack on diffusion models, demonstrating its impact on security tasks like adversarial purification and robustness certification.

Findings

01

DIFF2 significantly reduces purification effectiveness.

02

Backdoored models show decreased certified robustness.

03

Vulnerabilities pose risks to diffusion-based defenses.

Abstract

Thanks to their remarkable denoising capabilities, diffusion models are increasingly being employed as defensive tools to reinforce the security of other models, notably in purifying adversarial examples and certifying adversarial robustness. However, the security risks of these practices themselves remain largely unexplored, which is highly concerning. To bridge this gap, this work investigates the vulnerabilities of security-enhancing diffusion models. Specifically, we demonstrate that these models are highly susceptible to DIFF2, a simple yet effective backdoor attack, which substantially diminishes the security assurance provided by such models. Essentially, DIFF2 achieves this by integrating a malicious diffusion-sampling process into the diffusion model, guiding inputs embedded with specific triggers toward an adversary-defined distribution while preserving the normal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection