Wide Compression: Tensor Ring Nets

Wenqi Wang; Yifan Sun; Brian Eriksson; Wenlin Wang; Vaneet; Aggarwal

arXiv:1802.09052·cs.LG·February 27, 2018·27 cites

Wide Compression: Tensor Ring Nets

Wenqi Wang, Yifan Sun, Brian Eriksson, Wenlin Wang, Vaneet, Aggarwal

PDF

Open Access

TL;DR

Tensor Ring Networks (TR-Nets) leverage tensor ring factorization to significantly compress deep neural networks, reducing memory and computation needs while maintaining accuracy, thus enabling deployment on resource-constrained devices.

Contribution

The paper introduces TR-Nets, a novel tensor ring factorization-based method for compressing both fully connected and convolutional layers in deep neural networks.

Findings

01

Compressed LeNet-5 by 11x without accuracy loss

02

Achieved 243x compression of Wide ResNet with 2.3% accuracy degradation

03

Demonstrated potential for resource-constrained device deployment

Abstract

Deep neural networks have demonstrated state-of-the-art performance in a variety of real-world applications. In order to obtain performance gains, these networks have grown larger and deeper, containing millions or even billions of parameters and over a thousand layers. The trade-off is that these large architectures require an enormous amount of memory, storage, and computation, thus limiting their usability. Inspired by the recent tensor ring factorization, we introduce Tensor Ring Networks (TR-Nets), which significantly compress both the fully connected layers and the convolutional layers of deep neural networks. Our results show that our TR-Nets approach {is able to compress LeNet-5 by $11 \times$ without losing accuracy}, and can compress the state-of-the-art Wide ResNet by $243 \times$ with only 2.3\% degradation in {Cifar10 image classification}. Overall, this compression scheme…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Advanced Neural Network Applications · Parallel Computing and Optimization Techniques

MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection