"BNN - BN = ?": Training Binary Neural Networks without Batch Normalization
Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen,, Zhangyang Wang

TL;DR
This paper introduces a novel method to train binary neural networks without batch normalization, reducing computational costs and sample dependence while maintaining competitive accuracy across multiple datasets.
Contribution
The authors demonstrate for the first time that batch normalization can be completely removed from binary neural network training and inference, using specialized techniques to preserve performance.
Findings
Achieved over 92% accuracy on CIFAR-10 without batch normalization.
Maintained competitive accuracy on ImageNet with marginal performance drop.
Validated the approach across diverse BNN architectures and datasets.
Abstract
Batch normalization (BN) is a key facilitator and considered essential for state-of-the-art binary neural networks (BNN). However, the BN layer is costly to calculate and is typically implemented with non-binary parameters, leaving a hurdle for the efficient implementation of BNN training. It also introduces undesirable dependence between samples within each batch. Inspired by the latest advance on Batch Normalization Free (BN-Free) training, we extend their framework to training BNNs, and for the first time demonstrate that BNs can be completed removed from BNN training and inference regimes. By plugging in and customizing techniques including adaptive gradient clipping, scale weight standardization, and specialized bottleneck block, a BN-free BNN is capable of maintaining competitive accuracy compared to its BN-based counterpart. Extensive experiments validate the effectiveness of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsBatch Normalization
