Exploiting Local Structures with the Kronecker Layer in Convolutional Networks
Shuchang Zhou, Jia-Nan Wu, Yuxin Wu, Xinyu Zhou

TL;DR
This paper introduces a Kronecker layer technique for convolutional neural networks that reduces parameters and computation, enabling faster training and better performance on various datasets.
Contribution
It generalizes low-rank approximations using Kronecker products to improve CNN efficiency and modeling capacity, outperforming previous methods.
Findings
Achieves 3.3x speedup or 3.6x parameter reduction with minimal accuracy loss.
Enables larger feature maps and outperforms state-of-the-art on SVHN and CASIA-HWDB.
Demonstrates effectiveness across SVHN, scene text recognition, and ImageNet datasets.
Abstract
In this paper, we propose and study a technique to reduce the number of parameters and computation time in convolutional neural networks. We use Kronecker product to exploit the local structures within convolution and fully-connected layers, by replacing the large weight matrices by combinations of multiple Kronecker products of smaller matrices. Just as the Kronecker product is a generalization of the outer product from vectors to matrices, our method is a generalization of the low rank approximation method for convolution neural networks. We also introduce combinations of different shapes of Kronecker product to increase modeling capacity. Experiments on SVHN, scene text recognition and ImageNet dataset demonstrate that we can achieve speedup or parameter reduction with less than 1\% drop in accuracy, showing the effectiveness and efficiency of our method.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Medical Image Segmentation Techniques
MethodsConvolution
