Optimal Transport Classifier: Defending Against Adversarial Attacks by Regularized Deep Embedding
Yao Li, Martin Renqiang Min, Wenchao Yu, Cho-Jui Hsieh, Thomas C.M., Lee, and Erik Kruus

TL;DR
This paper introduces the OT-Classifier, a novel framework that uses optimal transport to embed images into a low-dimensional space, significantly enhancing robustness against adversarial attacks.
Contribution
It proposes a new regularized embedding approach based on optimal transport theory to improve neural network robustness against adversarial examples.
Findings
Achieves state-of-the-art performance against strong adversarial attacks
Demonstrates robustness improvements on benchmark datasets
Validates the effectiveness of low-dimensional embedding in defending attacks
Abstract
Recent studies have demonstrated the vulnerability of deep convolutional neural networks against adversarial examples. Inspired by the observation that the intrinsic dimension of image data is much smaller than its pixel space dimension and the vulnerability of neural networks grows with the input dimension, we propose to embed high-dimensional input images into a low-dimensional space to perform classification. However, arbitrarily projecting the input images to a low-dimensional space without regularization will not improve the robustness of deep neural networks. Leveraging optimal transport theory, we propose a new framework, Optimal Transport Classifier (OT-Classifier), and derive an objective that minimizes the discrepancy between the distribution of the true label and the distribution of the OT-Classifier output. Experimental results on several benchmark datasets show that, our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
