Kernel Normalized Convolutional Networks

Reza Nasirigerdeh; Reihaneh Torkzadehmahani; Daniel Rueckert; Georgios; Kaissis

arXiv:2205.10089·cs.LG·March 6, 2024·1 cites

Kernel Normalized Convolutional Networks

Reza Nasirigerdeh, Reihaneh Torkzadehmahani, Daniel Rueckert, Georgios, Kaissis

PDF

Open Access 1 Repo

TL;DR

KernelNorm introduces a batch-independent normalization method for convolutional networks, enabling effective training without BatchNorm, especially beneficial for small batch sizes and differential privacy, while maintaining high performance.

Contribution

The paper proposes KernelNorm and kernel normalized convolutional layers, creating KNConvNets that outperform BatchNorm-based models in various tasks without relying on batch-dependent normalization.

Findings

01

KNConvNets achieve higher or comparable accuracy to BatchNorm models.

02

They outperform layer and group normalization in private and non-private settings.

03

KernelNorm effectively combines batch-independence with high performance.

Abstract

Existing convolutional neural network architectures frequently rely upon batch normalization (BatchNorm) to effectively train the model. BatchNorm, however, performs poorly with small batch sizes, and is inapplicable to differential privacy. To address these limitations, we propose the kernel normalization (KernelNorm) and kernel normalized convolutional layers, and incorporate them into kernel normalized convolutional networks (KNConvNets) as the main building blocks. We implement KNConvNets corresponding to the state-of-the-art ResNets while forgoing the BatchNorm layers. Through extensive experiments, we illustrate that KNConvNets achieve higher or competitive performance compared to the BatchNorm counterparts in image classification and semantic segmentation. They also significantly outperform their batch-independent competitors including those based on layer and group normalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

reza-nasirigerdeh/norm-torch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsConcatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Residual Connection · Bottleneck Residual Block · Convolution · Softmax · Average Pooling · Residual Block · Dense Connections