Certified Robustness against Sparse Adversarial Perturbations via Data Localization
Ambar Pal, Ren\'e Vidal, Jeremias Sulam

TL;DR
This paper extends the theory of data localization for adversarial robustness from $\,ell_2$-bounded to $\,ell_0$-bounded perturbations, introducing a simple, effective classifier called Box-NN that improves certified robustness against sparse attacks.
Contribution
It provides necessary and sufficient conditions for $\,ell_0$-robust classifiers and proposes Box-NN, a simple classifier that enhances certified robustness against sparse adversarial attacks.
Findings
Box-NN outperforms existing methods on MNIST and Fashion-MNIST.
Theoretical conditions for $\,ell_0$-robustness are established.
Improved certified robustness against sparse attacks.
Abstract
Recent work in adversarial robustness suggests that natural data distributions are localized, i.e., they place high probability in small volume regions of the input space, and that this property can be utilized for designing classifiers with improved robustness guarantees for -bounded perturbations. Yet, it is still unclear if this observation holds true for more general metrics. In this work, we extend this theory to -bounded adversarial perturbations, where the attacker can modify a few pixels of the image but is unrestricted in the magnitude of perturbation, and we show necessary and sufficient conditions for the existence of -robust classifiers. Theoretical certification approaches in this regime essentially employ voting over a large ensemble of classifiers. Such procedures are combinatorial and expensive or require complicated certification techniques. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Fault Detection and Control Systems · Anomaly Detection Techniques and Applications
