Convolutional neural networks with low-rank regularization
Cheng Tai, Tong Xiao, Yi Zhang, Xiaogang Wang, Weinan E

TL;DR
This paper introduces a novel low-rank tensor decomposition algorithm for CNNs, enabling faster models with sometimes improved accuracy, demonstrated on multiple datasets and architectures.
Contribution
It develops an exact global optimizer for low-rank tensor decomposition and a new training method for low-rank CNNs from scratch.
Findings
Significant speedup in CNN inference times.
Improved accuracy on CIFAR-10 with low-rank models.
Effective across various architectures like AlexNet, VGG, and GoogleNet.
Abstract
Large CNNs have delivered impressive performance in various computer vision applications. But the storage and computation requirements make it problematic for deploying these models on mobile devices. Recently, tensor decompositions have been used for speeding up CNNs. In this paper, we further develop the tensor decomposition technique. We propose a new algorithm for computing the low-rank tensor decomposition for removing the redundancy in the convolution kernels. The algorithm finds the exact global optimizer of the decomposition and is more effective than iterative methods. Based on the decomposition, we further propose a new method for training low-rank constrained CNNs from scratch. Interestingly, while achieving a significant speedup, sometimes the low-rank constrained CNNs delivers significantly better performance than their non-constrained counterparts. On the CIFAR-10 dataset,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Image and Signal Denoising Methods
Methods1x1 Convolution · Average Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · Grouped Convolution · Dropout · How do I speak to a person at Expedia?-/+/ · GoogLeNet · Dense Connections
