Adversarial Robustness of Deep Neural Networks: A Survey from a Formal   Verification Perspective

Mark Huasong Meng; Guangdong Bai; Sin Gee Teo; Zhe Hou; Yan Xiao; Yun; Lin; Jin Song Dong

arXiv:2206.12227·cs.CR·October 12, 2022

Adversarial Robustness of Deep Neural Networks: A Survey from a Formal Verification Perspective

Mark Huasong Meng, Guangdong Bai, Sin Gee Teo, Zhe Hou, Yan Xiao, Yun, Lin, Jin Song Dong

PDF

TL;DR

This survey reviews formal verification methods for assessing the adversarial robustness of neural networks, highlighting approaches, classifications, and open challenges in ensuring trustworthy AI security.

Contribution

It provides a comprehensive taxonomy and analysis of formal verification techniques for neural network robustness, integrating insights across multiple domains.

Findings

01

Classifies verification techniques based on property specification and reasoning strategies

02

Demonstrates representative verification methods on sample models

03

Identifies open research questions in adversarial robustness verification

Abstract

Neural networks have been widely applied in security applications such as spam and phishing detection, intrusion prevention, and malware detection. This black-box method, however, often has uncertainty and poor explainability in applications. Furthermore, neural networks themselves are often vulnerable to adversarial attacks. For those reasons, there is a high demand for trustworthy and rigorous methods to verify the robustness of neural network models. Adversarial robustness, which concerns the reliability of a neural network when dealing with maliciously manipulated inputs, is one of the hottest topics in security and machine learning. In this work, we survey existing literature in adversarial robustness verification for neural networks and collect 39 diversified research works across machine learning, security, and software engineering domains. We systematically analyze their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.