Semi-tensor Product-based TensorDecomposition for Neural Network   Compression

Hengling Zhao; Yipeng Liu; Xiaolin Huang; Ce Zhu

arXiv:2109.15200·cs.LG·October 1, 2021·1 cites

Semi-tensor Product-based TensorDecomposition for Neural Network Compression

Hengling Zhao, Yipeng Liu, Xiaolin Huang, Ce Zhu

PDF

Open Access

TL;DR

This paper introduces semi-tensor product-based tensor decompositions to improve neural network compression, achieving higher compression ratios with minimal accuracy loss compared to traditional methods.

Contribution

It generalizes classical tensor decompositions using semi-tensor products, enabling more flexible and compact neural network compression techniques.

Findings

01

Semi-tensor tensor decompositions outperform traditional methods in compression ratio.

02

Higher compression factors achieved with similar accuracy.

03

Reduced training times for compressed models.

Abstract

The existing tensor networks adopt conventional matrix product for connection. The classical matrix product requires strict dimensionality consistency between factors, which can result in redundancy in data representation. In this paper, the semi-tensor product is used to generalize classical matrix product-based mode product to semi-tensor mode product. As it permits the connection of two factors with different dimensionality, more flexible and compact tensor decompositions can be obtained with smaller sizes of factors. Tucker decomposition, Tensor Train (TT) and Tensor Ring (TR) are common decomposition for low rank compression of deep neural networks. The semi-tensor product is applied to these tensor decompositions to obtained their generalized versions, i.e., semi-tensor Tucker decomposition (STTu), semi-tensor train(STT) and semi-tensor ring (STR). Experimental results show the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Advanced Neural Network Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia? · TuckER · 1x1 Convolution · Residual Connection · Batch Normalization · Max Pooling · Residual Block · Kaiming Initialization · Bottleneck Residual Block · Average Pooling