Understanding Robust Overfitting from the Feature Generalization Perspective
Chaojian Yu, Xiaolong Shi, Jun Yu, Bo Han, Tongliang Liu

TL;DR
This paper investigates robust overfitting in adversarial training from a feature generalization perspective, identifying natural data as the main cause and proposing methods to mitigate it, thereby improving robustness.
Contribution
It introduces a novel feature generalization perspective on robust overfitting and proposes two methods to prevent feature degradation during adversarial training.
Findings
Natural data causes robust overfitting.
Proposed methods effectively mitigate overfitting.
Enhanced adversarial robustness demonstrated on benchmarks.
Abstract
Adversarial training (AT) constructs robust neural networks by incorporating adversarial perturbations into natural data. However, it is plagued by the issue of robust overfitting (RO), which severely damages the model's robustness. In this paper, we investigate RO from a novel feature generalization perspective. Specifically, we design factor ablation experiments to assess the respective impacts of natural data and adversarial perturbations on RO, identifying that the inducing factor of RO stems from natural data. Given that the only difference between adversarial and natural training lies in the inclusion of adversarial perturbations, we further hypothesize that adversarial perturbations degrade the generalization of features in natural data and verify this hypothesis through extensive experiments. Based on these findings, we provide a holistic view of RO from the feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis
