The Unreasonable Effectiveness of the Final Batch Normalization Layer
Veysel Kocaman, Ofer M. Shir, Thomas Baeck

TL;DR
This paper investigates how the final Batch Normalization layer influences the performance of deep learning models in highly imbalanced image classification tasks, revealing its significant impact and limitations under various conditions.
Contribution
It extends prior work by analyzing the effects of the final BN layer across different datasets, architectures, and imbalance ratios, providing new insights into its role and limitations.
Findings
Final BN layer improves performance in imbalanced settings.
The performance gain persists even after removing BN during inference.
Impact is limited to single majority class scenarios.
Abstract
Early-stage disease indications are rarely recorded in real-world domains, such as Agriculture and Healthcare, and yet, their accurate identification is critical in that point of time. In this type of highly imbalanced classification problems, which encompass complex features, deep learning (DL) is much needed because of its strong detection capabilities. At the same time, DL is observed in practice to favor majority over minority classes and consequently suffer from inaccurate detection of the targeted early-stage indications. In this work, we extend the study done by Kocaman et al., 2020, showing that the final BN layer, when placed before the softmax output layer, has a considerable impact in highly imbalanced image classification problems as well as undermines the role of the softmax outputs as an uncertainty measure. This current study addresses additional hypotheses and reports on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Imbalanced Data Classification Techniques · Artificial Intelligence in Healthcare
MethodsSoftmax · Sigmoid Activation
