Interpolated Adversarial Training: Achieving Robust Neural Networks   without Sacrificing Too Much Accuracy

Alex Lamb; Vikas Verma; Kenji Kawaguchi; Alexander Matyasko; Savya; Khosla; Juho Kannala; Yoshua Bengio

arXiv:1906.06784·stat.ML·October 20, 2022·1 cites

Interpolated Adversarial Training: Achieving Robust Neural Networks without Sacrificing Too Much Accuracy

Alex Lamb, Vikas Verma, Kenji Kawaguchi, Alexander Matyasko, Savya, Khosla, Juho Kannala, Yoshua Bengio

PDF

Open Access 3 Repos

TL;DR

This paper introduces Interpolated Adversarial Training, a method that improves neural network robustness against adversarial attacks while maintaining high accuracy on unperturbed data, addressing a key challenge in deep learning.

Contribution

The paper proposes a novel interpolation-based adversarial training method that balances robustness and accuracy, with theoretical analysis supporting its effectiveness.

Findings

01

Retains robustness with only 6.45% standard error on CIFAR-10

02

Reduces the accuracy gap between robust and standard models significantly

03

Provides mathematical proof of the method's efficiency

Abstract

Adversarial robustness has become a central goal in deep learning, both in the theory and the practice. However, successful methods to improve the adversarial robustness (such as adversarial training) greatly hurt generalization performance on the unperturbed data. This could have a major impact on how the adversarial robustness affects real world systems (i.e. many may opt to forego robustness if it can improve accuracy on the unperturbed data). We propose Interpolated Adversarial Training, which employs recently proposed interpolation based training methods in the framework of adversarial training. On CIFAR-10, adversarial training increases the standard test error (when there is no adversary) from 4.43% to 12.32%, whereas with our Interpolated adversarial training we retain the adversarial robustness while achieving a standard test error of only 6.45%. With our technique, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications