Feature Prioritization and Regularization Improve Standard Accuracy and Adversarial Robustness
Chihuang Liu, Joseph JaJa

TL;DR
This paper introduces a model that combines feature prioritization via an attention mechanism and feature regularization to simultaneously enhance adversarial robustness and maintain standard accuracy, addressing the typical trade-off.
Contribution
The proposed approach employs a nonlinear attention module and $L_2$ regularization to improve both robustness and accuracy, with theoretical justification and extensive experimental validation.
Findings
Improved adversarial robustness over baseline models.
Maintained or enhanced standard classification accuracy.
Model aligns well with salient data features.
Abstract
Adversarial training has been successfully applied to build robust models at a certain cost. While the robustness of a model increases, the standard classification accuracy declines. This phenomenon is suggested to be an inherent trade-off. We propose a model that employs feature prioritization by a nonlinear attention module and feature regularization to improve the adversarial robustness and the standard accuracy relative to adversarial training. The attention module encourages the model to rely heavily on robust features by assigning larger weights to them while suppressing non-robust features. The regularizer encourages the model to extract similar features for the natural and adversarial images, effectively ignoring the added perturbation. In addition to evaluating the robustness of our model, we provide justification for the attention module and propose a novel experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis
