Lower Bounds on Adversarial Robustness from Optimal Transport
Arjun Nitin Bhagoji, Daniel Cullina, Prateek Mittal

TL;DR
This paper uses optimal transport theory to establish fundamental lower bounds on adversarial robustness in classification, providing insights into the limits of classifier performance under adversarial perturbations.
Contribution
It introduces a novel framework linking optimal transport costs to minimum achievable adversarial loss, offering new bounds and analysis for robust classification.
Findings
Optimal transport cost bounds the minimum adversarial loss.
Linear classifiers are optimal under certain conditions.
Sample complexity results extend previous bounds.
Abstract
While progress has been made in understanding the robustness of machine learning classifiers to test-time adversaries (evasion attacks), fundamental questions remain unresolved. In this paper, we use optimal transport to characterize the minimum possible loss in an adversarial classification scenario. In this setting, an adversary receives a random labeled example from one of two classes, perturbs the example subject to a neighborhood constraint, and presents the modified example to the classifier. We define an appropriate cost function such that the minimum transportation cost between the distributions of the two classes determines the minimum loss for any classifier. When the classifier comes from a restricted hypothesis class, the optimal transportation cost provides a lower bound. We apply our framework to the case of Gaussian data with norm-bounded adversaries and explicitly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Fault Detection and Control Systems
