Kernel Normalized Convolutional Networks
Reza Nasirigerdeh, Reihaneh Torkzadehmahani, Daniel Rueckert, Georgios, Kaissis

TL;DR
KernelNorm introduces a batch-independent normalization method for convolutional networks, enabling effective training without BatchNorm, especially beneficial for small batch sizes and differential privacy, while maintaining high performance.
Contribution
The paper proposes KernelNorm and kernel normalized convolutional layers, creating KNConvNets that outperform BatchNorm-based models in various tasks without relying on batch-dependent normalization.
Findings
KNConvNets achieve higher or comparable accuracy to BatchNorm models.
They outperform layer and group normalization in private and non-private settings.
KernelNorm effectively combines batch-independence with high performance.
Abstract
Existing convolutional neural network architectures frequently rely upon batch normalization (BatchNorm) to effectively train the model. BatchNorm, however, performs poorly with small batch sizes, and is inapplicable to differential privacy. To address these limitations, we propose the kernel normalization (KernelNorm) and kernel normalized convolutional layers, and incorporate them into kernel normalized convolutional networks (KNConvNets) as the main building blocks. We implement KNConvNets corresponding to the state-of-the-art ResNets while forgoing the BatchNorm layers. Through extensive experiments, we illustrate that KNConvNets achieve higher or competitive performance compared to the BatchNorm counterparts in image classification and semantic segmentation. They also significantly outperform their batch-independent competitors including those based on layer and group normalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsConcatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Residual Connection · Bottleneck Residual Block · Convolution · Softmax · Average Pooling · Residual Block · Dense Connections
