DiffBreak: Is Diffusion-Based Purification Robust?

Andre Kassis; Urs Hengartner; Yaoliang Yu

arXiv:2411.16598·cs.CR·February 11, 2026

DiffBreak: Is Diffusion-Based Purification Robust?

Andre Kassis, Urs Hengartner, Yaoliang Yu

PDF

Open Access 1 Repo 1 Video

TL;DR

DiffBreak critically evaluates diffusion-based purification for adversarial defense, revealing fundamental flaws and proposing more reliable evaluation methods that challenge its robustness claims.

Contribution

The paper introduces DiffBreak, a toolkit for proper gradient analysis of DBP, and proposes a statistically sound majority-vote scheme to improve robustness evaluation.

Findings

01

Gradient attacks target the diffusion model, not the classifier.

02

Single purification testing is invalid for robustness assessment.

03

Majority voting offers partial robustness improvement.

Abstract

Diffusion-based purification (DBP) has become a cornerstone defense against adversarial examples (AEs), regarded as robust due to its use of diffusion models (DMs) that project AEs onto the natural data manifold. We refute this core claim, theoretically proving that gradient-based attacks effectively target the DM rather than the classifier, causing DBP's outputs to align with adversarial distributions. This prompts a reassessment of DBP's robustness, attributing it to two critical flaws: incorrect gradients and inappropriate evaluation protocols that test only a single random purification of the AE. We show that with proper accounting for stochasticity and resubmission risk, DBP collapses. To support this, we introduce DiffBreak, the first reliable toolkit for differentiation through DBP, eliminating gradient flaws that previously further inflated robustness estimates. We also analyze…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andrekassis/DiffBreak
pytorchOfficial

Videos

DiffBreak: Is Diffusion-Based Purification Robust?· slideslive

Taxonomy

TopicsAdvancements in Semiconductor Devices and Circuit Design · Network Security and Intrusion Detection

MethodsAutoencoders · ALIGN · Diffusion · Lib