Backdoor Scanning for Deep Neural Networks through K-Arm Optimization
Guangyu Shen, Yingqi Liu, Guanhong Tao, Shengwei An, Qiuling Xu,, Siyuan Cheng, Shiqing Ma, Xiangyu Zhang

TL;DR
This paper introduces a K-Arm optimization method inspired by Multi-Arm Bandit algorithms to efficiently detect backdoor attacks in deep neural networks, especially those with many classes, improving accuracy and reducing scanning time.
Contribution
The paper presents a novel K-Arm optimization approach for backdoor detection that reduces complexity and enhances detection accuracy compared to existing methods.
Findings
Achieved top leaderboard performance on over 4000 models in TrojAI.
Outperformed three state-of-the-art techniques in accuracy.
Reduced scanning time significantly.
Abstract
Back-door attack poses a severe threat to deep learning systems. It injects hidden malicious behaviors to a model such that any input stamped with a special pattern can trigger such behaviors. Detecting back-door is hence of pressing need. Many existing defense techniques use optimization to generate the smallest input pattern that forces the model to misclassify a set of benign inputs injected with the pattern to a target label. However, the complexity is quadratic to the number of class labels such that they can hardly handle models with many classes. Inspired by Multi-Arm Bandit in Reinforcement Learning, we propose a K-Arm optimization method for backdoor detection. By iteratively and stochastically selecting the most promising labels for optimization with the guidance of an objective function, we substantially reduce the complexity, allowing to handle models with many classes.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Anomaly Detection Techniques and Applications
