Overfitting or Underfitting? Understand Robustness Drop in Adversarial   Training

Zichao Li; Liyuan Liu; Chengyu Dong; Jingbo Shang

arXiv:2010.08034·cs.LG·October 19, 2020·1 cites

Overfitting or Underfitting? Understand Robustness Drop in Adversarial Training

Zichao Li, Liyuan Liu, Chengyu Dong, Jingbo Shang

PDF

Open Access 2 Repos

TL;DR

This paper investigates the cause of robustness drop in adversarial training, identifying perturbation underfitting as the main factor, and proposes APART, an adaptive framework that improves robustness efficiently.

Contribution

It reveals that robustness decline is due to underfitting of perturbations and introduces APART, a novel adaptive adversarial training method that enhances robustness with less computation.

Findings

01

APART achieves comparable or better robustness than PGD-10.

02

APART reduces computational cost to about one-quarter of PGD-10.

03

Perturbation underfitting causes robustness drop, not overfitting.

Abstract

Our goal is to understand why the robustness drops after conducting adversarial training for too long. Although this phenomenon is commonly explained as overfitting, our analysis suggest that its primary cause is perturbation underfitting. We observe that after training for too long, FGSM-generated perturbations deteriorate into random noise. Intuitively, since no parameter updates are made to strengthen the perturbation generator, once this process collapses, it could be trapped in such local optima. Also, sophisticating this process could mostly avoid the robustness drop, which supports that this phenomenon is caused by underfitting instead of overfitting. In the light of our analyses, we propose APART, an adaptive adversarial training framework, which parameterizes perturbation generation and progressively strengthens them. Shielding perturbations from underfitting unleashes the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications