A Survey of Trojan Attacks and Defenses to Deep Neural Networks
Lingxin Jin, Xianyu Wen, Wei Jiang, Jinyu Zhan

TL;DR
This survey reviews the evolution, attack methods, and defense strategies related to Trojan attacks on deep neural networks, emphasizing their real-world risks and the need for effective countermeasures.
Contribution
It provides a comprehensive overview and comparative analysis of Trojan attack techniques and defenses, highlighting practical implications and future research directions.
Findings
Trojan attacks can be effectively embedded in DNNs, posing significant security threats.
Current defenses vary in effectiveness and practicality.
Real-world deployment of Trojan attacks remains a critical concern.
Abstract
Deep Neural Networks (DNNs) have found extensive applications in safety-critical artificial intelligence systems, such as autonomous driving and facial recognition systems. However, recent research has revealed their susceptibility to Neural Network Trojans (NN Trojans) maliciously injected by adversaries. This vulnerability arises due to the intricate architecture and opacity of DNNs, resulting in numerous redundant neurons embedded within the models. Adversaries exploit these vulnerabilities to conceal malicious Trojans within DNNs, thereby causing erroneous outputs and posing substantial threats to the efficacy of DNN-based applications. This article presents a comprehensive survey of Trojan attacks against DNNs and the countermeasure methods employed to mitigate them. Initially, we trace the evolution of the concept from traditional Trojans to NN Trojans, highlighting the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security
MethodsGravity
