Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor   Decomposition

Miao Yin; Siyu Liao; Xiao-Yang Liu; Xiaodong Wang; Bo Yuan

arXiv:2005.04366·cs.LG·May 12, 2020·21 cites

Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition

Miao Yin, Siyu Liao, Xiao-Yang Liu, Xiaodong Wang, Bo Yuan

PDF

Open Access

TL;DR

This paper introduces a Hierarchical Tucker tensor decomposition method to compress RNNs, significantly improving model size reduction and accuracy compared to existing tensor-based approaches.

Contribution

The paper proposes a novel HT decomposition-based RNN compression method that enhances representation capability and achieves superior compression and accuracy.

Findings

01

HT-LSTM outperforms TT-LSTM, TR-LSTM, and BT-LSTM in compression ratio.

02

HT-LSTM maintains or improves test accuracy across datasets.

03

Hierarchical structure enhances model expressiveness.

Abstract

Recurrent Neural Networks (RNNs) have been widely used in sequence analysis and modeling. However, when processing high-dimensional data, RNNs typically require very large model sizes, thereby bringing a series of deployment challenges. Although the state-of-the-art tensor decomposition approaches can provide good model compression performance, these existing methods are still suffering some inherent limitations, such as restricted representation capability and insufficient model complexity reduction. To overcome these limitations, in this paper we propose to develop compact RNN models using Hierarchical Tucker (HT) decomposition. HT decomposition brings strong hierarchical structure to the decomposed RNN models, which is very useful and important for enhancing the representation capability. Meanwhile, HT decomposition provides higher storage and computational cost reduction than the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications

MethodsTuckER · Sigmoid Activation · Tanh Activation · Long Short-Term Memory