Kronecker CP Decomposition with Fast Multiplication for Compressing RNNs
Dingheng Wang, Bijiao Wu, Guangshe Zhao, Man Yao, Hengnu, Chen, Lei Deng, Tianyi Yan, Guoqi Li

TL;DR
This paper introduces a novel Kronecker CP decomposition for RNN compression, achieving high compression ratios and improved efficiency in space and computation, with potential for parallel acceleration.
Contribution
The paper proposes the KCP decomposition and fast multiplication algorithms, enabling efficient RNN compression with high accuracy and superior performance over existing tensor formats.
Findings
278,219x compression ratio achieved
Comparable accuracy to other tensor formats
Enhanced space and computation efficiency
Abstract
Recurrent neural networks (RNNs) are powerful in the tasks oriented to sequential data, such as natural language processing and video recognition. However, since the modern RNNs, including long-short term memory (LSTM) and gated recurrent unit (GRU) networks, have complex topologies and expensive space/computation complexity, compressing them becomes a hot and promising topic in recent years. Among plenty of compression methods, tensor decomposition, e.g., tensor train (TT), block term (BT), tensor ring (TR) and hierarchical Tucker (HT), appears to be the most amazing approach since a very high compression ratio might be obtained. Nevertheless, none of these tensor decomposition formats can provide both the space and computation efficiency. In this paper, we consider to compress RNNs based on a novel Kronecker CANDECOMP/PARAFAC (KCP) decomposition, which is derived from Kronecker tensor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTuckER
