Make Some Noise: Reliable and Efficient Single-Step Adversarial Training

Pau de Jorge; Adel Bibi; Riccardo Volpi; Amartya Sanyal; Philip H. S.; Torr; Gr\'egory Rogez; Puneet K. Dokania

arXiv:2202.01181·cs.LG·October 19, 2022·31 cites

Make Some Noise: Reliable and Efficient Single-Step Adversarial Training

Pau de Jorge, Adel Bibi, Riccardo Volpi, Amartya Sanyal, Philip H. S., Torr, Gr\'egory Rogez, Puneet K. Dokania

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Noise-FGSM, a simple yet effective single-step adversarial training method that prevents catastrophic overfitting by using stronger noise without clipping, matching or surpassing prior methods' performance with much less computational cost.

Contribution

The authors propose Noise-FGSM, a novel single-step adversarial training approach that avoids catastrophic overfitting through stronger noise and no clipping, offering a faster alternative to existing regularizers.

Findings

01

N-FGSM prevents catastrophic overfitting at large perturbation radii.

02

N-FGSM matches or exceeds GradAlign's robustness performance.

03

N-FGSM achieves 3x speed-up over previous state-of-the-art methods.

Abstract

Recently, Wong et al. showed that adversarial training with single-step FGSM leads to a characteristic failure mode named Catastrophic Overfitting (CO), in which a model becomes suddenly vulnerable to multi-step attacks. Experimentally they showed that simply adding a random perturbation prior to FGSM (RS-FGSM) could prevent CO. However, Andriushchenko and Flammarion observed that RS-FGSM still leads to CO for larger perturbations, and proposed a computationally expensive regularizer (GradAlign) to avoid it. In this work, we methodically revisit the role of noise and clipping in single-step adversarial training. Contrary to previous intuitions, we find that using a stronger noise around the clean sample combined with \textit{not clipping} is highly effective in avoiding CO for large perturbation radii. We then propose Noise-FGSM (N-FGSM) that, while providing the benefits of single-step…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pdejorge/n-fgsm
pytorchOfficial

Videos

Make Some Noise: Reliable and Efficient Single-Step Adversarial Training· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Integrated Circuits and Semiconductor Failure Analysis