CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for   Adversarial Defense

Mingkun Zhang; Keping Bi; Wei Chen; Quanrun Chen; Jiafeng Guo; Xueqi; Cheng

arXiv:2410.23091·cs.CV·February 26, 2025

CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense

Mingkun Zhang, Keping Bi, Wei Chen, Quanrun Chen, Jiafeng Guo, Xueqi, Cheng

PDF

Open Access 1 Repo 1 Video

TL;DR

CausalDiff introduces a causality-inspired diffusion model that disentangles essential label-causative factors from non-causative factors to improve neural network robustness against unseen adversarial attacks.

Contribution

The paper proposes a novel causal diffusion model with a causal information bottleneck for disentangling factors, enhancing adversarial defense beyond existing methods.

Findings

01

Significantly outperforms state-of-the-art defenses on unseen attacks.

02

Achieves over 86% robustness on CIFAR-10.

03

Demonstrates effectiveness across multiple datasets.

Abstract

Despite ongoing efforts to defend neural classifiers from adversarial attacks, they remain vulnerable, especially to unseen attacks. In contrast, humans are difficult to be cheated by subtle manipulations, since we make judgments only based on essential factors. Inspired by this observation, we attempt to model label generation with essential label-causative factors and incorporate label-non-causative factors to assist data generation. For an adversarial example, we aim to discriminate the perturbations as non-causative factors and make predictions only based on the label-causative factors. Concretely, we propose a casual diffusion model (CausalDiff) that adapts diffusion models for conditional data generation and disentangles the two types of casual factors by learning towards a novel casual information bottleneck objective. Empirically, CausalDiff has significantly outperformed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cas-aisafetybasicresearchgroup/causaldiff
pytorchOfficial

Videos

CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsDiffusion