RotLSTM: Rotating Memories in Recurrent Neural Networks

Vlad Velici; Adam Pr\"ugel-Bennett

arXiv:2105.00357·cs.LG·May 4, 2021

RotLSTM: Rotating Memories in Recurrent Neural Networks

Vlad Velici, Adam Pr\"ugel-Bennett

PDF

Open Access

TL;DR

RotLSTM introduces trainable rotation matrices to modify LSTM cell states, significantly improving performance on certain tasks by enhancing long-term dependency modeling.

Contribution

The paper proposes a novel modification to LSTM units using rotation matrices, which is a new approach to enhance memory capabilities in recurrent neural networks.

Findings

01

Improved performance on bAbI dataset tasks.

02

Rotation matrices enhance long-term dependency learning.

03

Demonstrates the effectiveness of memory rotation in RNNs.

Abstract

Long Short-Term Memory (LSTM) units have the ability to memorise and use long-term dependencies between inputs to generate predictions on time series data. We introduce the concept of modifying the cell state (memory) of LSTMs using rotation matrices parametrised by a new set of trainable weights. This addition shows significant increases of performance on some of the tasks from the bAbI dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing · Model Reduction and Neural Networks