Tensor train decompositions on recurrent networks

Alejandro Murua; Ramchalam Ramakrishnan; Xinlin Li; Rui Heng Yang,; Vahid Partovi Nia

arXiv:2006.05442·cs.LG·June 11, 2020·1 cites

Tensor train decompositions on recurrent networks

Alejandro Murua, Ramchalam Ramakrishnan, Xinlin Li, Rui Heng Yang,, Vahid Partovi Nia

PDF

Open Access

TL;DR

This paper advocates for using matrix product state (MPS) tensor trains to compress recurrent neural networks, especially LSTMs, offering better storage and inference efficiency, supported by theoretical and experimental evidence.

Contribution

It introduces MPS tensor trains as a superior method for RNN compression, highlighting their advantages over MPOs through analysis and NLP experiments.

Findings

01

MPS tensor trains outperform MPOs in storage efficiency.

02

MPS-based LSTM compression maintains performance with fewer parameters.

03

Theoretical analysis supports MPS advantages in RNN compression.

Abstract

Recurrent neural networks (RNN) such as long-short-term memory (LSTM) networks are essential in a multitude of daily live tasks such as speech, language, video, and multimodal learning. The shift from cloud to edge computation intensifies the need to contain the growth of RNN parameters. Current research on RNN shows that despite the performance obtained on convolutional neural networks (CNN), keeping a good performance in compressed RNNs is still a challenge. Most of the literature on compression focuses on CNNs using matrix product (MPO) operator tensor trains. However, matrix product state (MPS) tensor trains have more attractive features than MPOs, in terms of storage reduction and computing time at inference. We show that MPS tensor trains should be at the forefront of LSTM network compression through a theoretical analysis and practical experiments on NLP task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Parallel Computing and Optimization Techniques · Model Reduction and Neural Networks

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory