Batch Normalization Sampling

Zhaodong Chen; Lei Deng; Guoqi Li; Jiawei Sun; Xing Hu; Xin Ma; Yuan; Xie

arXiv:1810.10962·cs.LG·November 5, 2018·1 cites

Batch Normalization Sampling

Zhaodong Chen, Lei Deng, Guoqi Li, Jiawei Sun, Xing Hu, Xin Ma, Yuan, Xie

PDF

Open Access

TL;DR

This paper introduces sampling-based strategies for Batch Normalization in deep neural networks, significantly reducing computational costs while maintaining accuracy, and extends the approach to micro-batch normalization for small batch sizes.

Contribution

It proposes novel sampling strategies and a simple Virtual Dataset Normalization method to improve BN efficiency without sacrificing performance.

Findings

01

Up to 20% training speedup on GPU

02

Negligible impact on accuracy and convergence

03

Effective for micro-batch normalization with tiny batch sizes

Abstract

Deep Neural Networks (DNNs) thrive in recent years in which Batch Normalization (BN) plays an indispensable role. However, it has been observed that BN is costly due to the reduction operations. In this paper, we propose alleviating this problem through sampling only a small fraction of data for normalization at each iteration. Specifically, we model it as a statistical sampling problem and identify that by sampling less correlated data, we can largely reduce the requirement of the number of data for statistics estimation in BN, which directly simplifies the reduction operations. Based on this conclusion, we propose two sampling strategies, "Batch Sampling" (randomly select several samples from each batch) and "Feature Sampling" (randomly select a small patch from each feature map of all samples), that take both computational efficiency and sample correlation into consideration.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification