Towards Understanding Dual BN In Hybrid Adversarial Training
Chenshuang Zhang, Chaoning Zhang, Kang Zhang, Axi Niu, Junmo Kim, In, So Kweon

TL;DR
This paper investigates the role of Dual Batch Normalization in hybrid adversarial training, revealing that affine parameters are more critical than statistics for robustness and offering new insights into its underlying mechanisms.
Contribution
It challenges prior beliefs about the importance of normalization statistics, emphasizes the significance of affine parameters, and proposes a unified framework for improving hybrid adversarial training.
Findings
Disentangling affine parameters is more important than statistics.
The domain gap between adversarial and clean samples is smaller than expected.
Affine parameters influence robustness during inference.
Abstract
There is a growing concern about applying batch normalization (BN) in adversarial training (AT), especially when the model is trained on both adversarial samples and clean samples (termed Hybrid-AT). With the assumption that adversarial and clean samples are from two different domains, a common practice in prior works is to adopt Dual BN, where BN and BN are used for adversarial and clean branches, respectively. A popular belief for motivating Dual BN is that estimating normalization statistics of this mixture distribution is challenging and thus disentangling it for normalization achieves stronger robustness. In contrast to this belief, we reveal that disentangling statistics plays a less role than disentangling affine parameters in model training. This finding aligns with prior work (Rebuffi et al., 2023), and we build upon their research for further investigations. We demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · High-Velocity Impact and Material Behavior
MethodsBatch Normalization
