Conserve-Update-Revise to Cure Generalization and Robustness Trade-off   in Adversarial Training

Shruthi Gowda; Bahram Zonooz; Elahe Arani

arXiv:2401.14948·cs.LG·January 29, 2024·1 cites

Conserve-Update-Revise to Cure Generalization and Robustness Trade-off in Adversarial Training

Shruthi Gowda, Bahram Zonooz, Elahe Arani

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces CURE, a novel training framework that selectively updates neural network layers to improve robustness against adversarial attacks while maintaining generalization, addressing the robustness-generalization trade-off.

Contribution

The paper proposes CURE, a dataset- and architecture-agnostic method using gradient prominence for selective weight updates to enhance robustness and generalization in adversarial training.

Findings

01

CURE improves robustness and reduces overfitting.

02

Selective layer updating enhances learning capacity.

03

Mitigates robust overfitting across datasets and architectures.

Abstract

Adversarial training improves the robustness of neural networks against adversarial attacks, albeit at the expense of the trade-off between standard and robust generalization. To unveil the underlying factors driving this phenomenon, we examine the layer-wise learning capabilities of neural networks during the transition from a standard to an adversarial setting. Our empirical findings demonstrate that selectively updating specific layers while preserving others can substantially enhance the network's learning capacity. We therefore propose CURE, a novel training framework that leverages a gradient prominence criterion to perform selective conservation, updating, and revision of weights. Importantly, CURE is designed to be dataset- and architecture-agnostic, ensuring its applicability across various scenarios. It effectively tackles both memorization and overfitting issues, thus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neurai-lab/cure
pytorchOfficial

Videos

Conserve-Update-Revise to Cure Generalization and Robustness Trade-off in Adversarial Training· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications