Adversarial robustness for latent models: Revisiting the robust-standard   accuracies tradeoff

Adel Javanmard; Mohammad Mehrabi

arXiv:2110.11950·cs.LG·April 4, 2022

Adversarial robustness for latent models: Revisiting the robust-standard accuracies tradeoff

Adel Javanmard, Mohammad Mehrabi

PDF

Open Access 1 Repo

TL;DR

This paper investigates the tradeoff between standard and robust accuracy in adversarial training, showing that low-dimensional data structures can mitigate this tradeoff and enable models to perform well on both metrics.

Contribution

The paper demonstrates that low-dimensional manifold structures in data can reduce the robustness-accuracy tradeoff, providing theoretical insights and empirical validation.

Findings

01

Low-dimensional data structures mitigate the robustness-accuracy tradeoff.

02

Models on low-dimensional manifolds achieve near-optimal standard and robust accuracy.

03

Numerical experiments confirm the theory on MNIST with Mixture of Factor Analyzers.

Abstract

Over the past few years, several adversarial training methods have been proposed to improve the robustness of machine learning models against adversarial perturbations in the input. Despite remarkable progress in this regard, adversarial training is often observed to drop the standard test accuracy. This phenomenon has intrigued the research community to investigate the potential tradeoff between standard accuracy (a.k.a generalization) and robust accuracy (a.k.a robust generalization) as two performance measures. In this paper, we revisit this tradeoff for latent models and argue that this tradeoff is mitigated when the data enjoys a low-dimensional structure. In particular, we consider binary classification under two data generative models, namely Gaussian mixture model and generalized linear model, where the features data lie on a low-dimensional manifold. We develop a theory to show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cleverhans-lab/cleverhans
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Anomaly Detection Techniques and Applications

MethodsTest