Compressing 3DCNNs Based on Tensor Train Decomposition
Dingheng Wang, Guangshe Zhao, Guoqi Li, Lei Deng, Yang Wu

TL;DR
This paper introduces a tensor train decomposition method to significantly compress 3D convolutional neural networks, enabling their deployment in resource-constrained environments while maintaining accuracy.
Contribution
The work proposes a novel tensor train format for 3D convolutional kernels and explores rank selection for effective compression of 3DCNNs.
Findings
Achieves around 100x compression ratio without significant accuracy loss
Demonstrates effectiveness on VIVA, UCF11, and UCF101 datasets
Provides insights into tensor train rank selection and computational complexity
Abstract
Three dimensional convolutional neural networks (3DCNNs) have been applied in many tasks, e.g., video and 3D point cloud recognition. However, due to the higher dimension of convolutional kernels, the space complexity of 3DCNNs is generally larger than that of traditional two dimensional convolutional neural networks (2DCNNs). To miniaturize 3DCNNs for the deployment in confining environments such as embedded devices, neural network compression is a promising approach. In this work, we adopt the tensor train (TT) decomposition, a straightforward and simple in situ training compression method, to shrink the 3DCNN models. Through proposing tensorizing 3D convolutional kernels in TT format, we investigate how to select appropriate TT ranks for achieving higher compression ratio. We have also discussed the redundancy of 3D convolutional kernels for compression, core significance and future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
