Defenses in Adversarial Machine Learning: A Survey
Baoyuan Wu, Shaokui Wei, Mingli Zhu, Meixi Zheng, Zihao Zhu, Mingda, Zhang, Hongrui Chen, Danni Yuan, Li Liu, Qingshan Liu

TL;DR
This survey systematically reviews defense strategies against adversarial attacks in machine learning, categorizing methods across the entire ML system lifecycle to enhance understanding and inspire future robust defense development.
Contribution
It introduces a unified, lifecycle-based taxonomy for defense paradigms, facilitating comprehensive analysis and comparison of existing methods in adversarial ML defenses.
Findings
Provides a complete taxonomy of defense methods at five ML system stages
Analyzes connections and differences among defense paradigms
Facilitates understanding of defense mechanisms and future research directions
Abstract
Adversarial phenomenon has been widely observed in machine learning (ML) systems, especially in those using deep neural networks, describing that ML systems may produce inconsistent and incomprehensible predictions with humans at some particular cases. This phenomenon poses a serious security threat to the practical application of ML systems, and several advanced attack paradigms have been developed to explore it, mainly including backdoor attacks, weight attacks, and adversarial examples. For each individual attack paradigm, various defense paradigms have been developed to improve the model robustness against the corresponding attack paradigm. However, due to the independence and diversity of these defense paradigms, it is difficult to examine the overall robustness of an ML system against different kinds of attacks.This survey aims to build a systematic review of all existing defense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Electrostatic Discharge in Electronics
