Layer-Aware Analysis of Catastrophic Overfitting: Revealing the   Pseudo-Robust Shortcut Dependency

Runqi Lin; Chaojian Yu; Bo Han; Hang Su; Tongliang Liu

arXiv:2405.16262·cs.LG·September 17, 2024

Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency

Runqi Lin, Chaojian Yu, Bo Han, Hang Su, Tongliang Liu

PDF

Open Access 1 Repo

TL;DR

This paper investigates layer-specific changes during catastrophic overfitting in adversarial training, revealing pseudo-robust shortcuts in early layers and proposing a layer-aware perturbation method to mitigate CO and improve robustness.

Contribution

It uncovers the role of pseudo-robust shortcuts in early layers during CO and introduces LAP, a layer-aware perturbation technique to prevent CO in adversarial training.

Findings

01

LAP effectively prevents catastrophic overfitting.

02

Early layers are more susceptible to distortion during CO.

03

Removing shortcuts partially restores robustness.

Abstract

Catastrophic overfitting (CO) presents a significant challenge in single-step adversarial training (AT), manifesting as highly distorted deep neural networks (DNNs) that are vulnerable to multi-step adversarial attacks. However, the underlying factors that lead to the distortion of decision boundaries remain unclear. In this work, we delve into the specific changes within different DNN layers and discover that during CO, the former layers are more susceptible, experiencing earlier and greater distortion, while the latter layers show relative insensitivity. Our analysis further reveals that this increased sensitivity in former layers stems from the formation of pseudo-robust shortcuts, which alone can impeccably defend against single-step adversarial attacks but bypass genuine-robust learning, resulting in distorted decision boundaries. Eliminating these shortcuts can partially restore…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tmllab/2024_ICML_LAP
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection