Understanding Robust Overfitting from the Feature Generalization   Perspective

Chaojian Yu; Xiaolong Shi; Jun Yu; Bo Han; Tongliang Liu

arXiv:2310.00607·cs.LG·July 30, 2024

Understanding Robust Overfitting from the Feature Generalization Perspective

Chaojian Yu, Xiaolong Shi, Jun Yu, Bo Han, Tongliang Liu

PDF

Open Access

TL;DR

This paper investigates robust overfitting in adversarial training from a feature generalization perspective, identifying natural data as the main cause and proposing methods to mitigate it, thereby improving robustness.

Contribution

It introduces a novel feature generalization perspective on robust overfitting and proposes two methods to prevent feature degradation during adversarial training.

Findings

01

Natural data causes robust overfitting.

02

Proposed methods effectively mitigate overfitting.

03

Enhanced adversarial robustness demonstrated on benchmarks.

Abstract

Adversarial training (AT) constructs robust neural networks by incorporating adversarial perturbations into natural data. However, it is plagued by the issue of robust overfitting (RO), which severely damages the model's robustness. In this paper, we investigate RO from a novel feature generalization perspective. Specifically, we design factor ablation experiments to assess the respective impacts of natural data and adversarial perturbations on RO, identifying that the inducing factor of RO stems from natural data. Given that the only difference between adversarial and natural training lies in the inclusion of adversarial perturbations, we further hypothesize that adversarial perturbations degrade the generalization of features in natural data and verify this hypothesis through extensive experiments. Based on these findings, we provide a holistic view of RO from the feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis