An Optimal Transport Approach for Computing Adversarial Training Lower   Bounds in Multiclass Classification

Nicolas Garcia Trillos; Matt Jacobs; Jakwang Kim; Matthew Werenski

arXiv:2401.09191·cs.LG·January 18, 2024·1 cites

An Optimal Transport Approach for Computing Adversarial Training Lower Bounds in Multiclass Classification

Nicolas Garcia Trillos, Matt Jacobs, Jakwang Kim, Matthew Werenski

PDF

Open Access 1 Repo

TL;DR

This paper introduces computationally efficient algorithms based on optimal transport theory to estimate lower bounds on adversarial risk in multiclass classification, improving robustness analysis of neural networks.

Contribution

It leverages multimarginal optimal transport to develop tractable algorithms for lower bounds in adversarial training, using linear programming and Sinkhorn regularization.

Findings

01

Algorithms successfully compute lower bounds on adversarial risk.

02

Approach is validated on MNIST and CIFAR-10 datasets.

03

Truncation of class interactions reduces computational complexity.

Abstract

Despite the success of deep learning-based algorithms, it is widely known that neural networks may fail to be robust. A popular paradigm to enforce robustness is adversarial training (AT), however, this introduces many computational and theoretical difficulties. Recent works have developed a connection between AT in the multiclass classification setting and multimarginal optimal transport (MOT), unlocking a new set of tools to study this problem. In this paper, we leverage the MOT connection to propose computationally tractable numerical algorithms for computing universal lower bounds on the optimal adversarial risk and identifying optimal classifiers. We propose two main algorithms based on linear programming (LP) and entropic regularization (Sinkhorn). Our key insight is that one can harmlessly truncate the higher order interactions between classes, preventing the combinatorial run…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mattwerenski/adversarial-ot
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)

MethodsSparse Evolutionary Training