Canonical convolutional neural networks
Lokesh Veeramacheneni, Moritz Wolter, Reinhard Klein, Jochen, Garcke

TL;DR
This paper introduces a canonical weight normalization method for convolutional neural networks inspired by tensor decomposition, enabling effective training, initialization, and model compression.
Contribution
It proposes a novel canonical re-parameterization for CNN weights, simplifying training and enabling efficient model compression compared to traditional methods.
Findings
Competitive normalization performance on MNIST, CIFAR10, SVHN.
Cheaper initialization methods can replace the power method.
Simplifies network compression through truncation of parameter sums.
Abstract
We introduce canonical weight normalization for convolutional neural networks. Inspired by the canonical tensor decomposition, we express the weight tensors in so-called canonical networks as scaled sums of outer vector products. In particular, we train network weights in the decomposed form, where scale weights are optimized separately for each mode. Additionally, similarly to weight normalization, we include a global scaling parameter. We study the initialization of the canonical form by running the power method and by drawing randomly from Gaussian or uniform distributions. Our results indicate that we can replace the power method with cheaper initializations drawn from standard distributions. The canonical re-parametrization leads to competitive normalization performance on the MNIST, CIFAR10, and SVHN data sets. Moreover, the formulation simplifies network compression. Once…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Advanced Neural Network Applications · Computational Physics and Python Applications
MethodsWeight Normalization
