Normalization Techniques in Training DNNs: Methodology, Analysis and Application
Lei Huang, Jie Qin, Yi Zhou, Fan Zhu, Li Liu, Ling Shao

TL;DR
This paper reviews the evolution, methodology, and applications of normalization techniques in training deep neural networks, providing a unified framework and insights for designing new methods and understanding their role in optimization.
Contribution
It offers a comprehensive taxonomy and analysis of normalization methods, decomposing their components and discussing their applications across various tasks.
Findings
Unified view of normalization techniques from an optimization perspective
Decomposition of normalization methods into three core components
Guidance for designing new normalization approaches
Abstract
Normalization techniques are essential for accelerating the training and improving the generalization of deep neural networks (DNNs), and have successfully been used in various applications. This paper reviews and comments on the past, present and future of normalization methods in the context of DNN training. We provide a unified picture of the main motivation behind different approaches from the perspective of optimization, and present a taxonomy for understanding the similarities and differences between them. Specifically, we decompose the pipeline of the most representative normalizing activation methods into three components: the normalization area partitioning, normalization operation and normalization representation recovery. In doing so, we provide insight for designing new normalization technique. Finally, we discuss the current progress in understanding normalization methods,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
