Towards Understanding Dual BN In Hybrid Adversarial Training

Chenshuang Zhang; Chaoning Zhang; Kang Zhang; Axi Niu; Junmo Kim; In; So Kweon

arXiv:2403.19150·cs.LG·March 29, 2024·1 cites

Towards Understanding Dual BN In Hybrid Adversarial Training

Chenshuang Zhang, Chaoning Zhang, Kang Zhang, Axi Niu, Junmo Kim, In, So Kweon

PDF

Open Access

TL;DR

This paper investigates the role of Dual Batch Normalization in hybrid adversarial training, revealing that affine parameters are more critical than statistics for robustness and offering new insights into its underlying mechanisms.

Contribution

It challenges prior beliefs about the importance of normalization statistics, emphasizes the significance of affine parameters, and proposes a unified framework for improving hybrid adversarial training.

Findings

01

Disentangling affine parameters is more important than statistics.

02

The domain gap between adversarial and clean samples is smaller than expected.

03

Affine parameters influence robustness during inference.

Abstract

There is a growing concern about applying batch normalization (BN) in adversarial training (AT), especially when the model is trained on both adversarial samples and clean samples (termed Hybrid-AT). With the assumption that adversarial and clean samples are from two different domains, a common practice in prior works is to adopt Dual BN, where BN and BN are used for adversarial and clean branches, respectively. A popular belief for motivating Dual BN is that estimating normalization statistics of this mixture distribution is challenging and thus disentangling it for normalization achieves stronger robustness. In contrast to this belief, we reveal that disentangling statistics plays a less role than disentangling affine parameters in model training. This finding aligns with prior work (Rebuffi et al., 2023), and we build upon their research for further investigations. We demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · High-Velocity Impact and Material Behavior

MethodsBatch Normalization