Towards A Critical Evaluation of Robustness for Deep Learning Backdoor Countermeasures
Huming Qiu, Hua Ma, Zhi Zhang, Alsharif Abuadbba, Wei Kang, Anmin Fu,, Yansong Gao

TL;DR
This paper critically evaluates the robustness of existing deep learning backdoor countermeasures, revealing their vulnerabilities and non-robust cases through formal proofs and empirical analysis, emphasizing the need for thorough robustness assessment.
Contribution
It provides the first systematic analysis of the robustness of three influential backdoor detection methods, uncovering their non-robust scenarios independent of adaptive attacks.
Findings
Neural Cleanse, ABS, and MNTD have inherent non-robust cases.
Varying tasks, models, datasets, and hyperparameters can bypass these defenses.
Recent MNTD is particularly vulnerable in certain non-robust scenarios.
Abstract
Since Deep Learning (DL) backdoor attacks have been revealed as one of the most insidious adversarial attacks, a number of countermeasures have been developed with certain assumptions defined in their respective threat models. However, the robustness of these countermeasures is inadvertently ignored, which can introduce severe consequences, e.g., a countermeasure can be misused and result in a false implication of backdoor detection. For the first time, we critically examine the robustness of existing backdoor countermeasures with an initial focus on three influential model-inspection ones that are Neural Cleanse (S&P'19), ABS (CCS'19), and MNTD (S&P'21). Although the three countermeasures claim that they work well under their respective threat models, they have inherent unexplored non-robust cases depending on factors such as given tasks, model architectures, datasets, and defense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Anomaly Detection Techniques and Applications
