Boundary Adversarial Examples Against Adversarial Overfitting

Muhammad Zaid Hameed; Beat Buesser

arXiv:2211.14088·cs.LG·November 28, 2022

Boundary Adversarial Examples Against Adversarial Overfitting

Muhammad Zaid Hameed, Beat Buesser

PDF

Open Access

TL;DR

This paper investigates the causes of robust overfitting in adversarial training, evaluates mitigation strategies, and introduces helper adversarial examples to improve clean accuracy without sacrificing robustness.

Contribution

It provides insights into robust overfitting, assesses existing mitigation methods, and proposes a novel approach using helper adversarial examples to enhance adversarial training.

Findings

01

Mitigation strategies can reduce overfitting but often lower clean accuracy.

02

Helper adversarial examples improve clean accuracy without harming robustness.

03

Mitigation approaches are potentially complementary when combined with helper examples.

Abstract

Standard adversarial training approaches suffer from robust overfitting where the robust accuracy decreases when models are adversarially trained for too long. The origin of this problem is still unclear and conflicting explanations have been reported, i.e., memorization effects induced by large loss data or because of small loss data and growing differences in loss distribution of training samples as the adversarial training progresses. Consequently, several mitigation approaches including early stopping, temporal ensembling and weight perturbations on small loss data have been proposed to mitigate the effect of robust overfitting. However, a side effect of these strategies is a larger reduction in clean accuracy compared to standard adversarial training. In this paper, we investigate if these mitigation approaches are complimentary to each other in improving adversarial training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)