Progressive Backdoor Erasing via connecting Backdoor and Adversarial Attacks
Bingxu Mu, Zhenxing Niu, Le Wang, Xue Wang, Rong Jin, Gang Hua

TL;DR
This paper reveals a connection between backdoor and adversarial attacks in neural networks and introduces a novel Progressive Backdoor Erasing method that effectively removes backdoors without needing a clean dataset.
Contribution
The paper proposes a new link between backdoor and adversarial attacks and introduces PBE, a backdoor defense method that does not require a clean dataset for effective backdoor removal.
Findings
PBE effectively erases backdoors against 5 state-of-the-art attacks.
PBE maintains model performance on clean samples.
PBE outperforms existing backdoor defense methods.
Abstract
Deep neural networks (DNNs) are known to be vulnerable to both backdoor attacks as well as adversarial attacks. In the literature, these two types of attacks are commonly treated as distinct problems and solved separately, since they belong to training-time and inference-time attacks respectively. However, in this paper we find an intriguing connection between them: for a model planted with backdoors, we observe that its adversarial examples have similar behaviors as its triggered images, i.e., both activate the same subset of DNN neurons. It indicates that planting a backdoor into a model will significantly affect the model's adversarial examples. Based on these observations, a novel Progressive Backdoor Erasing (PBE) algorithm is proposed to progressively purify the infected model by leveraging untargeted adversarial attacks. Different from previous backdoor defense methods, one…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Malware Detection Techniques
