Compressing Neural Networks Using Tensor Networks with Exponentially Fewer Variational Parameters
Yong Qing, Ke Li, Peng-Fei Zhou, Shi-Ju Ran

TL;DR
This paper introduces a novel neural network compression method using deep tensor networks that drastically reduces variational parameters while maintaining or improving accuracy on standard datasets.
Contribution
The work presents a deep automically differentiable tensor network (ADTN) approach for neural network compression, achieving exponentially fewer parameters than traditional methods.
Findings
Compresses large neural networks to thousands of parameters.
Maintains or improves accuracy on datasets like CIFAR-10.
Demonstrates superior compression compared to existing tensor-based methods.
Abstract
Neural network (NN) designed for challenging machine learning tasks is in general a highly nonlinear mapping that contains massive variational parameters. High complexity of NN, if unbounded or unconstrained, might unpredictably cause severe issues including \R{overfitting}, loss of generalization power, and unbearable cost of hardware. In this work, we propose a general compression scheme that significantly reduces the variational parameters of NN's, despite of their specific types (linear, convolutional, \textit{etc}), by encoding them to deep \R{automatically differentiable} tensor network (ADTN) that contains exponentially-fewer free parameters. Superior compression performance of our scheme is demonstrated on several widely-recognized NN's (FC-2, LeNet-5, AlextNet, ZFNet and VGG-16) and datasets (MNIST, CIFAR-10 and CIFAR-100). For instance, we compress two linear layers in VGG-16…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Parallel Computing and Optimization Techniques
MethodsMax Pooling · Dropout · Softmax · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Local Contrast Normalization · Dense Connections · ZFNet
