CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path   Fuzzing

Haibo Jin; Ruoxi Chen; Jinyin Chen; Haibin Zheng; Yang Zhang; and; Haohan Wang

arXiv:2112.13064·cs.CR·July 18, 2024·1 cites

CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing

Haibo Jin, Ruoxi Chen, Jinyin Chen, Haibin Zheng, Yang Zhang, and, Haohan Wang

PDF

Open Access

TL;DR

CatchBackdoor is a novel backdoor detection method that identifies trojaned neural paths by differential fuzzing, improving detection accuracy especially for stealthy attacks without relying on benign examples.

Contribution

It introduces a new approach leveraging critical neuron paths and differential fuzzing to detect backdoors, overcoming limitations of existing methods.

Findings

01

Outperforms state-of-the-art detection methods.

02

Effective against stealthy and small triggers.

03

Does not require benign examples for detection.

Abstract

The success of deep neural networks (DNNs) in real-world applications has benefited from abundant pre-trained models. However, the backdoored pre-trained models can pose a significant trojan threat to the deployment of downstream DNNs. Numerous backdoor detection methods have been proposed but are limited to two aspects: (1) high sensitivity on trigger size, especially on stealthy attacks (i.e., blending attacks and defense adaptive attacks); (2) rely heavily on benign examples for reverse engineering. To address these challenges, we empirically observed that trojaned behaviors triggered by various trojan attacks can be attributed to the trojan path, composed of top- $k$ critical neurons with more significant contributions to model prediction changes. Motivated by it, we propose CatchBackdoor, a detection method against trojan attacks. Based on the close connection between trojaned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications