A Fully Tensorized Recurrent Neural Network

Charles C. Onu; Jacob E. Miller; Doina Precup

arXiv:2010.04196·cs.LG·November 11, 2021·1 cites

A Fully Tensorized Recurrent Neural Network

Charles C. Onu, Jacob E. Miller, Doina Precup

PDF

Open Access 1 Repo

TL;DR

This paper introduces a fully tensorized RNN architecture using tensor-train factorization to significantly reduce model size and improve training stability without sacrificing performance.

Contribution

It presents a novel tensorized RNN design that encodes weight matrices with tensor-train factorization, enabling efficient, compact models for sequential tasks.

Findings

01

Reduces model size by several orders of magnitude.

02

Maintains or improves performance on classification and verification tasks.

03

Enhances inference speed and training stability.

Abstract

Recurrent neural networks (RNNs) are powerful tools for sequential modeling, but typically require significant overparameterization and regularization to achieve optimal performance. This leads to difficulties in the deployment of large RNNs in resource-limited settings, while also introducing complications in hyperparameter selection and training. To address these issues, we introduce a "fully tensorized" RNN architecture which jointly encodes the separate weight matrices within each recurrent cell using a lightweight tensor-train (TT) factorization. This approach represents a novel form of weight sharing which reduces model size by several orders of magnitude, while still maintaining similar or better performance compared to standard RNNs. Experiments on image classification and speaker verification tasks demonstrate further benefits for reducing inference times and stabilizing model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

onucharles/tensorized-rnn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Tensor decomposition and applications · Topic Modeling