Towards Rapid and Robust Adversarial Training with One-Step Attacks
Leo Schwinn, Ren\'e Raab, Bj\"orn Eskofier

TL;DR
This paper introduces a faster adversarial training method using FGSM combined with noise injection and a learnable regularization layer, achieving robustness comparable or superior to more computationally intensive PGD-based methods.
Contribution
The paper proposes a novel combination of noise injection and a learnable regularization layer to enable effective one-step adversarial training, reducing computational costs.
Findings
FGSM-based training with noise injection matches PGD robustness.
Adding PNIL improves gradient obfuscation prevention.
The combined method outperforms PGD-based adversarial training.
Abstract
Adversarial training is the most successful empirical method for increasing the robustness of neural networks against adversarial attacks. However, the most effective approaches, like training with Projected Gradient Descent (PGD) are accompanied by high computational complexity. In this paper, we present two ideas that, in combination, enable adversarial training with the computationally less expensive Fast Gradient Sign Method (FGSM). First, we add uniform noise to the initial data point of the FGSM attack, which creates a wider variety of adversaries, thus prohibiting overfitting to one particular perturbation bound. Further, we add a learnable regularization step prior to the neural network, which we call Pixelwise Noise Injection Layer (PNIL). Inputs propagated trough the PNIL are resampled from a learned Gaussian distribution. The regularization induced by the PNIL prevents the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
