Evolutionary Trigger Detection and Lightweight Model Repair Based Backdoor Defense
Qi Zhou, Zipeng Ye, Yubo Tang, Wenjian Luo, Yuhui Shi, and Yan Jia

TL;DR
This paper introduces an efficient backdoor defense for deep neural networks using evolutionary trigger detection and lightweight model repair, effectively identifying and mitigating backdoors even with limited resources.
Contribution
It proposes CETF, an evolutionary algorithm-based trigger detection method, and combines it with lightweight unlearning techniques for effective backdoor mitigation.
Findings
CETF accurately detects triggers in various attack scenarios.
The combined approach effectively repairs backdoored models.
The method is practical and stable across different conditions.
Abstract
Deep Neural Networks (DNNs) have been widely used in many areas such as autonomous driving and face recognition. However, DNN model is fragile to backdoor attack. A backdoor in the DNN model can be activated by a poisoned input with trigger and leads to wrong prediction, which causes serious security issues in applications. It is challenging for current defenses to eliminate the backdoor effectively with limited computing resources, especially when the sizes and numbers of the triggers are variable as in the physical world. We propose an efficient backdoor defense based on evolutionary trigger detection and lightweight model repair. In the first phase of our method, CAM-focus Evolutionary Trigger Filter (CETF) is proposed for trigger detection. CETF is an effective sample-preprocessing based method with the evolutionary algorithm, and our experimental results show that CETF not only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Network Security and Intrusion Detection
MethodsBatch Normalization
