Wide Compression: Tensor Ring Nets
Wenqi Wang, Yifan Sun, Brian Eriksson, Wenlin Wang, Vaneet, Aggarwal

TL;DR
Tensor Ring Networks (TR-Nets) leverage tensor ring factorization to significantly compress deep neural networks, reducing memory and computation needs while maintaining accuracy, thus enabling deployment on resource-constrained devices.
Contribution
The paper introduces TR-Nets, a novel tensor ring factorization-based method for compressing both fully connected and convolutional layers in deep neural networks.
Findings
Compressed LeNet-5 by 11x without accuracy loss
Achieved 243x compression of Wide ResNet with 2.3% accuracy degradation
Demonstrated potential for resource-constrained device deployment
Abstract
Deep neural networks have demonstrated state-of-the-art performance in a variety of real-world applications. In order to obtain performance gains, these networks have grown larger and deeper, containing millions or even billions of parameters and over a thousand layers. The trade-off is that these large architectures require an enormous amount of memory, storage, and computation, thus limiting their usability. Inspired by the recent tensor ring factorization, we introduce Tensor Ring Networks (TR-Nets), which significantly compress both the fully connected layers and the convolutional layers of deep neural networks. Our results show that our TR-Nets approach {is able to compress LeNet-5 by without losing accuracy}, and can compress the state-of-the-art Wide ResNet by with only 2.3\% degradation in {Cifar10 image classification}. Overall, this compression scheme…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Advanced Neural Network Applications · Parallel Computing and Optimization Techniques
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection
