On the Minimal Adversarial Perturbation for Deep Neural Networks with Provable Estimation Error
Fabio Brau, Giulio Rossolini, Alessandro Biondi, Giorgio Buttazzo

TL;DR
This paper introduces two lightweight methods for estimating the minimal adversarial perturbation in deep neural networks, providing provable bounds on the estimation error and enhancing robustness guarantees.
Contribution
It proposes novel strategies that enable error estimation of the minimal adversarial perturbation, addressing a gap in provable robustness analysis for DNNs.
Findings
The methods approximate the theoretical minimal perturbation near decision boundaries.
Experimental results validate the accuracy of the error bounds.
The approach offers provable robustness guarantees against adversarial attacks.
Abstract
Although Deep Neural Networks (DNNs) have shown incredible performance in perceptive and control tasks, several trustworthy issues are still open. One of the most discussed topics is the existence of adversarial perturbations, which has opened an interesting research line on provable techniques capable of quantifying the robustness of a given input. In this regard, the Euclidean distance of the input from the classification boundary denotes a well-proved robustness assessment as the minimal affordable adversarial perturbation. Unfortunately, computing such a distance is highly complex due the non-convex nature of NNs. Despite several methods have been proposed to address this issue, to the best of our knowledge, no provable results have been presented to estimate and bound the error committed. This paper addresses this issue by proposing two lightweight strategies to find the minimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
