TL;DR
This paper analyzes adversarial risk in machine learning through the lens of optimal transport, deriving fundamental limits and optimal classifiers for various distributions and loss functions.
Contribution
It introduces a novel optimal transport framework to quantify and analyze adversarial risk, providing explicit optimal classifiers and risk bounds.
Findings
Optimal adversarial risk is characterized by an optimal transport cost.
Explicit optimal classifiers are derived for univariate distributions.
Fundamental limits on adversarial risk are computed for real datasets.
Abstract
Modern machine learning algorithms perform poorly on adversarially manipulated data. Adversarial risk quantifies the error of classifiers in adversarial settings; adversarial classifiers minimize adversarial risk. In this paper, we analyze adversarial risk and adversarial classifiers from an optimal transport perspective. We show that the optimal adversarial risk for binary classification with 0-1 loss is determined by an optimal transport cost between the probability distributions of the two classes. We develop optimal transport plans (probabilistic couplings) for univariate distributions such as the normal, the uniform, and the triangular distribution. We also derive optimal adversarial classifiers in these settings. Our analysis leads to algorithm-independent fundamental limits on adversarial risk, which we calculate for several real-world datasets. We extend our results to general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
