TL;DR
This paper introduces a game-based method for approximate verification of deep neural networks, providing provable guarantees on robustness measures and demonstrating effectiveness in safety-critical scenarios.
Contribution
It formulates robustness verification as two-player games with provable approximation guarantees, employing an anytime approach and search algorithms for practical evaluation.
Findings
Competitive performance against existing adversarial crafting algorithms
Effective evaluation of neural network robustness in safety-critical applications
Provable bounds on approximation errors in robustness measures
Abstract
Despite the improved accuracy of deep neural networks, the discovery of adversarial examples has raised serious safety concerns. In this paper, we study two variants of pointwise robustness, the maximum safe radius problem, which for a given input sample computes the minimum distance to an adversarial example, and the feature robustness problem, which aims to quantify the robustness of individual features to adversarial perturbations. We demonstrate that, under the assumption of Lipschitz continuity, both problems can be approximated using finite optimisation by discretising the input space, and the approximation has provable guarantees, i.e., the error is bounded. We then show that the resulting optimisation problems can be reduced to the solution of two-player turn-based games, where the first player selects features and the second perturbs the image within the feature. While the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
