An Overview of Backdoor Attacks Against Deep Neural Networks and Possible Defences
Wei Guo, Benedetta Tondi, Mauro Barni

TL;DR
This paper reviews backdoor attacks on deep neural networks, classifying them based on attacker control and defense capabilities, highlighting their strengths, weaknesses, and implications for AI security.
Contribution
It provides a comprehensive classification and analysis of backdoor attack types and defenses, summarizing current research and identifying gaps in security measures.
Findings
Backdoor attacks can be classified by attacker control and defense verification capabilities.
Existing defenses vary in effectiveness depending on attack type and scenario.
The overview highlights strengths and weaknesses of current backdoor mitigation strategies.
Abstract
Together with impressive advances touching every aspect of our society, AI technology based on Deep Neural Networks (DNN) is bringing increasing security concerns. While attacks operating at test time have monopolised the initial attention of researchers, backdoor attacks, exploiting the possibility of corrupting DNN models by interfering with the training process, represents a further serious threat undermining the dependability of AI techniques. In a backdoor attack, the attacker corrupts the training data so to induce an erroneous behaviour at test time. Test time errors, however, are activated only in the presence of a triggering event corresponding to a properly crafted input sample. In this way, the corrupted network continues to work as expected for regular inputs, and the malicious behaviour occurs only when the attacker decides to activate the backdoor hidden within the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Network Security and Intrusion Detection · Advanced Malware Detection Techniques
