Batch Normalization Increases Adversarial Vulnerability and Decreases Adversarial Transferability: A Non-Robust Feature Perspective
Philipp Benz, Chaoning Zhang, In So Kweon

TL;DR
This paper investigates how batch normalization affects deep neural networks by increasing reliance on non-robust features, which enhances accuracy but reduces adversarial robustness and transferability, providing new insights into model robustness dynamics.
Contribution
It introduces a framework to disentangle robustness and usefulness, revealing BN's tendency to favor non-robust features and analyzing the learning process of RFs and NRFs.
Findings
BN shifts models towards non-robust features
RFs transfer better than NRFs in adversarial settings
Framework reveals the sequential learning of RFs then NRFs
Abstract
Batch normalization (BN) has been widely used in modern deep neural networks (DNNs) due to improved convergence. BN is observed to increase the model accuracy while at the cost of adversarial robustness. There is an increasing interest in the ML community to understand the impact of BN on DNNs, especially related to the model robustness. This work attempts to understand the impact of BN on DNNs from a non-robust feature perspective. Straightforwardly, the improved accuracy can be attributed to the better utilization of useful features. It remains unclear whether BN mainly favors learning robust features (RFs) or non-robust features (NRFs). Our work presents empirical evidence that supports that BN shifts a model towards being more dependent on NRFs. To facilitate the analysis of such a feature robustness shift, we propose a framework for disentangling robust usefulness into robustness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications
