Towards Extremely Compact RNNs for Video Recognition with Fully   Decomposed Hierarchical Tucker Structure

Miao Yin; Siyu Liao; Xiao-Yang Liu; Xiaodong Wang; Bo Yuan

arXiv:2104.05758·cs.CV·April 21, 2021·5 cites

Towards Extremely Compact RNNs for Video Recognition with Fully Decomposed Hierarchical Tucker Structure

Miao Yin, Siyu Liao, Xiao-Yang Liu, Xiaodong Wang, Bo Yuan

PDF

Open Access

TL;DR

This paper introduces a fully decomposed hierarchical Tucker (FDHT) structure for RNNs, enabling extremely compact models that maintain high accuracy, particularly for video recognition tasks, with significantly fewer parameters than existing methods.

Contribution

The paper proposes a novel fully decomposition approach using hierarchical Tucker structure for RNNs, achieving superior compression and accuracy over prior tensor decomposition methods.

Findings

01

FDHT-LSTM achieves high accuracy with only a few thousand parameters.

02

Our method reduces model size by over 10,000 times compared to previous models.

03

FDHT-LSTM outperforms state-of-the-art compressed RNNs in both size and accuracy.

Abstract

Recurrent Neural Networks (RNNs) have been widely used in sequence analysis and modeling. However, when processing high-dimensional data, RNNs typically require very large model sizes, thereby bringing a series of deployment challenges. Although various prior works have been proposed to reduce the RNN model sizes, executing RNN models in resource-restricted environments is still a very challenging problem. In this paper, we propose to develop extremely compact RNN models with fully decomposed hierarchical Tucker (FDHT) structure. The HT decomposition does not only provide much higher storage cost reduction than the other tensor decomposition approaches but also brings better accuracy performance improvement for the compact RNN models. Meanwhile, unlike the existing tensor decomposition-based methods that can only decompose the input-to-hidden layer of RNNs, our proposed fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Advanced Neural Network Applications · Human Pose and Action Recognition

MethodsTuckER · Tanh Activation · Sigmoid Activation · Long Short-Term Memory