Adversarial Robustness of Deep Neural Networks: A Survey from a Formal Verification Perspective
Mark Huasong Meng, Guangdong Bai, Sin Gee Teo, Zhe Hou, Yan Xiao, Yun, Lin, Jin Song Dong

TL;DR
This survey reviews formal verification methods for assessing the adversarial robustness of neural networks, highlighting approaches, classifications, and open challenges in ensuring trustworthy AI security.
Contribution
It provides a comprehensive taxonomy and analysis of formal verification techniques for neural network robustness, integrating insights across multiple domains.
Findings
Classifies verification techniques based on property specification and reasoning strategies
Demonstrates representative verification methods on sample models
Identifies open research questions in adversarial robustness verification
Abstract
Neural networks have been widely applied in security applications such as spam and phishing detection, intrusion prevention, and malware detection. This black-box method, however, often has uncertainty and poor explainability in applications. Furthermore, neural networks themselves are often vulnerable to adversarial attacks. For those reasons, there is a high demand for trustworthy and rigorous methods to verify the robustness of neural network models. Adversarial robustness, which concerns the reliability of a neural network when dealing with maliciously manipulated inputs, is one of the hottest topics in security and machine learning. In this work, we survey existing literature in adversarial robustness verification for neural networks and collect 39 diversified research works across machine learning, security, and software engineering domains. We systematically analyze their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
