Interpolated Adversarial Training: Achieving Robust Neural Networks without Sacrificing Too Much Accuracy
Alex Lamb, Vikas Verma, Kenji Kawaguchi, Alexander Matyasko, Savya, Khosla, Juho Kannala, Yoshua Bengio

TL;DR
This paper introduces Interpolated Adversarial Training, a method that improves neural network robustness against adversarial attacks while maintaining high accuracy on unperturbed data, addressing a key challenge in deep learning.
Contribution
The paper proposes a novel interpolation-based adversarial training method that balances robustness and accuracy, with theoretical analysis supporting its effectiveness.
Findings
Retains robustness with only 6.45% standard error on CIFAR-10
Reduces the accuracy gap between robust and standard models significantly
Provides mathematical proof of the method's efficiency
Abstract
Adversarial robustness has become a central goal in deep learning, both in the theory and the practice. However, successful methods to improve the adversarial robustness (such as adversarial training) greatly hurt generalization performance on the unperturbed data. This could have a major impact on how the adversarial robustness affects real world systems (i.e. many may opt to forego robustness if it can improve accuracy on the unperturbed data). We propose Interpolated Adversarial Training, which employs recently proposed interpolation based training methods in the framework of adversarial training. On CIFAR-10, adversarial training increases the standard test error (when there is no adversary) from 4.43% to 12.32%, whereas with our Interpolated adversarial training we retain the adversarial robustness while achieving a standard test error of only 6.45%. With our technique, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
