Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks
Sofia Ivolgina, P. Thomas Fletcher, Baba C. Vemuri

TL;DR
This paper demonstrates that Stein shrinkage estimators improve batch normalization's robustness and accuracy, especially under adversarial attacks, leading to state-of-the-art results in image classification and segmentation tasks.
Contribution
It proves the dominance of Stein shrinkage estimators over traditional methods for BN parameters under adversarial conditions and shows improved robustness and performance in deep learning models.
Findings
Stein shrinkage estimators outperform sample mean and variance in adversarial settings.
James-Stein BN yields smaller Lipschitz constants, indicating better regularity.
Stein-corrected BN achieves SOTA results on multiple datasets.
Abstract
Batch normalization (BN) is a ubiquitous operation in deep neural networks, primarily used to improve stability and regularization during training. BN centers and scales feature maps using sample means and variances, which are naturally suited for Stein's shrinkage estimation. Applying such shrinkage yields more accurate mean and variance estimates of the batch in the mean-squared-error sense. In this paper, we prove that the Stein shrinkage estimator of the mean and variance dominates over the sample mean and variance estimators, respectively, in the presence of adversarial attacks modeled using sub-Gaussian distributions. Furthermore, by construction, the James-Stein (JS) BN yields a smaller local Lipschitz constant compared to the vanilla BN, implying better regularity properties and potentially improved robustness. This facilitates and justifies the application of Stein shrinkage to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Convolution · 3 Dimensional Convolutional Neural Network · HRNet
