Do Normalization Layers in a Deep ConvNet Really Need to Be Distinct?
Ping Luo, Zhanglin Peng, Jiamin Ren, Ruimao Zhang

TL;DR
This paper investigates whether different normalization layers in a ConvNet should use distinct normalizers, finding that allowing layer-specific choices improves learning and generalization, influenced mainly by depth and batch size.
Contribution
It introduces a switchable normalization approach enabling each layer to select its own normalizer, revealing the importance of layer-specific normalization choices in deep networks.
Findings
Distinct normalizers enhance learning and generalization.
Normalizer choices depend on depth and batch size.
Different datasets and tasks prefer different normalizers.
Abstract
Yes, they do. This work investigates a perspective for deep learning: whether different normalization layers in a ConvNet require different normalizers. This is the first step towards understanding this phenomenon. We allow each convolutional layer to be stacked before a switchable normalization (SN) that learns to choose a normalizer from a pool of normalization methods. Through systematic experiments in ImageNet, COCO, Cityscapes, and ADE20K, we answer three questions: (a) Is it useful to allow each normalization layer to select its own normalizer? (b) What impacts the choices of normalizers? (c) Do different tasks and datasets prefer different normalizers? Our results suggest that (1) using distinct normalizers improves both learning and generalization of a ConvNet; (2) the choices of normalizers are more related to depth and batch size, but less relevant to parameter initialization,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification
MethodsInstance Normalization · Layer Normalization · Softmax · Batch Normalization · Switchable Normalization
