Batch Normalization Sampling
Zhaodong Chen, Lei Deng, Guoqi Li, Jiawei Sun, Xing Hu, Xin Ma, Yuan, Xie

TL;DR
This paper introduces sampling-based strategies for Batch Normalization in deep neural networks, significantly reducing computational costs while maintaining accuracy, and extends the approach to micro-batch normalization for small batch sizes.
Contribution
It proposes novel sampling strategies and a simple Virtual Dataset Normalization method to improve BN efficiency without sacrificing performance.
Findings
Up to 20% training speedup on GPU
Negligible impact on accuracy and convergence
Effective for micro-batch normalization with tiny batch sizes
Abstract
Deep Neural Networks (DNNs) thrive in recent years in which Batch Normalization (BN) plays an indispensable role. However, it has been observed that BN is costly due to the reduction operations. In this paper, we propose alleviating this problem through sampling only a small fraction of data for normalization at each iteration. Specifically, we model it as a statistical sampling problem and identify that by sampling less correlated data, we can largely reduce the requirement of the number of data for statistics estimation in BN, which directly simplifies the reduction operations. Based on this conclusion, we propose two sampling strategies, "Batch Sampling" (randomly select several samples from each batch) and "Feature Sampling" (randomly select a small patch from each feature map of all samples), that take both computational efficiency and sample correlation into consideration.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
