Where and How to Attack? A Causality-Inspired Recipe for Generating   Counterfactual Adversarial Examples

Ruichu Cai; Yuxuan Zhu; Jie Qiao; Zefeng Liang; Furui Liu; Zhifeng Hao

arXiv:2312.13628·cs.LG·January 29, 2024·1 cites

Where and How to Attack? A Causality-Inspired Recipe for Generating Counterfactual Adversarial Examples

Ruichu Cai, Yuxuan Zhu, Jie Qiao, Zefeng Liang, Furui Liu, Zhifeng Hao

PDF

Open Access 1 Repo

TL;DR

This paper introduces a causality-inspired framework called CADE for generating realistic counterfactual adversarial examples by considering the causal data generation process, improving attack relevance and effectiveness.

Contribution

It presents a novel causality-based approach to identify where and how to attack neural networks with more realistic adversarial examples, addressing limitations of traditional methods.

Findings

01

CADE achieves competitive attack success across various scenarios.

02

The causality-based approach improves the realism of adversarial examples.

03

Theoretical analysis clarifies the vulnerability sources of DNNs.

Abstract

Deep neural networks (DNNs) have been demonstrated to be vulnerable to well-crafted \emph{adversarial examples}, which are generated through either well-conceived $L_{p}$ -norm restricted or unrestricted attacks. Nevertheless, the majority of those approaches assume that adversaries can modify any features as they wish, and neglect the causal generating process of the data, which is unreasonable and unpractical. For instance, a modification in income would inevitably impact features like the debt-to-income ratio within a banking system. By considering the underappreciated causal generating process, first, we pinpoint the source of the vulnerability of DNNs via the lens of causality, then give theoretical results to answer \emph{where to attack}. Second, considering the consequences of the attack interventions on the current state of the examples to generate more realistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EthanChu7/CADE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)