Batch Layer Normalization, A new normalization layer for CNNs and RNN

Amir Ziaee; Erion \c{C}ano

arXiv:2209.08898·cs.LG·January 16, 2023·1 cites

Batch Layer Normalization, A new normalization layer for CNNs and RNN

Amir Ziaee, Erion \c{C}ano

PDF

Open Access 1 Repo

TL;DR

This paper proposes Batch Layer Normalization (BLN), a new normalization layer for CNNs and RNNs that adaptively combines batch and layer normalization, improving convergence and flexibility during training.

Contribution

BLN introduces a hybrid normalization method that dynamically balances batch and feature normalization, enhancing training stability and hyper-parameter tuning.

Findings

01

BLN converges faster than batch and layer normalization.

02

BLN is effective for both CNNs and RNNs.

03

BLN supports theoretical analysis and task-dependent configuration.

Abstract

This study introduces a new normalization layer termed Batch Layer Normalization (BLN) to reduce the problem of internal covariate shift in deep neural network layers. As a combined version of batch and layer normalization, BLN adaptively puts appropriate weight on mini-batch and feature normalization based on the inverse size of mini-batches to normalize the input to a layer during the learning process. It also performs the exact computation with a minor change at inference times, using either mini-batch statistics or population statistics. The decision process to either use statistics of mini-batch or population gives BLN the ability to play a comprehensive role in the hyper-parameter optimization process of models. The key advantage of BLN is the support of the theoretical analysis of being independent of the input data, and its statistical configuration heavily depends on the task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

a2amir/batch-layer-normalization
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Face and Expression Recognition

MethodsTest · Batch Normalization · Layer Normalization