The Curse of Concentration in Robust Learning: Evasion and Poisoning Attacks from Concentration of Measure
Saeed Mahloujifar, Dimitrios I. Diochnos, Mohammad Mahmoody

TL;DR
This paper connects the vulnerability of classifiers to adversarial attacks with the mathematical phenomenon of measure concentration, showing that high-dimensional, concentrated spaces inherently lead to adversarial examples and poisoning vulnerabilities.
Contribution
It provides a theoretical framework linking measure concentration to adversarial and poisoning attacks, extending existing results and introducing new attack bounds for various distributions.
Findings
Adversarial vulnerability is inherent in concentrated metric spaces.
New attack bounds of O(√n) for Levy families and product distributions.
Poisoning attacks can significantly increase failure probability with few data modifications.
Abstract
Many modern machine learning classifiers are shown to be vulnerable to adversarial perturbations of the instances. Despite a massive amount of work focusing on making classifiers robust, the task seems quite challenging. In this work, through a theoretical study, we investigate the adversarial risk and robustness of classifiers and draw a connection to the well-known phenomenon of concentration of measure in metric measure spaces. We show that if the metric probability space of the test instance is concentrated, any classifier with some initial constant error is inherently vulnerable to adversarial perturbations. One class of concentrated metric probability spaces are the so-called Levy families that include many natural distributions. In this special case, our attacks only need to perturb the test instance by at most to make it misclassified, where is the data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
