Compressing Neural Networks Using Tensor Networks with Exponentially Fewer Variational Parameters

Yong Qing; Ke Li; Peng-Fei Zhou; Shi-Ju Ran

arXiv:2305.06058·cs.LG·May 23, 2025·1 cites

Compressing Neural Networks Using Tensor Networks with Exponentially Fewer Variational Parameters

Yong Qing, Ke Li, Peng-Fei Zhou, Shi-Ju Ran

PDF

Open Access

TL;DR

This paper introduces a novel neural network compression method using deep tensor networks that drastically reduces variational parameters while maintaining or improving accuracy on standard datasets.

Contribution

The work presents a deep automically differentiable tensor network (ADTN) approach for neural network compression, achieving exponentially fewer parameters than traditional methods.

Findings

01

Compresses large neural networks to thousands of parameters.

02

Maintains or improves accuracy on datasets like CIFAR-10.

03

Demonstrates superior compression compared to existing tensor-based methods.

Abstract

Neural network (NN) designed for challenging machine learning tasks is in general a highly nonlinear mapping that contains massive variational parameters. High complexity of NN, if unbounded or unconstrained, might unpredictably cause severe issues including \R{overfitting}, loss of generalization power, and unbearable cost of hardware. In this work, we propose a general compression scheme that significantly reduces the variational parameters of NN's, despite of their specific types (linear, convolutional, \textit{etc}), by encoding them to deep \R{automatically differentiable} tensor network (ADTN) that contains exponentially-fewer free parameters. Superior compression performance of our scheme is demonstrated on several widely-recognized NN's (FC-2, LeNet-5, AlextNet, ZFNet and VGG-16) and datasets (MNIST, CIFAR-10 and CIFAR-100). For instance, we compress two linear layers in VGG-16…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Parallel Computing and Optimization Techniques

MethodsMax Pooling · Dropout · Softmax · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Local Contrast Normalization · Dense Connections · ZFNet