Understanding LSTM -- a tutorial into Long Short-Term Memory Recurrent   Neural Networks

Ralf C. Staudemeyer; Eric Rothstein Morris

arXiv:1909.09586·cs.NE·September 23, 2019·152 cites

Understanding LSTM -- a tutorial into Long Short-Term Memory Recurrent Neural Networks

Ralf C. Staudemeyer, Eric Rothstein Morris

PDF

Open Access 1 Repo

TL;DR

This paper provides a comprehensive tutorial on LSTM-RNNs, explaining their evolution, functioning, and improvements in documentation, aiming to clarify their impressive capabilities and correct previous inconsistencies.

Contribution

It offers an improved, unified explanation of LSTM-RNNs, correcting errors and enhancing understanding of their development and operation.

Findings

01

Clarified the evolution of LSTM-RNNs

02

Fixed errors in previous publications

03

Unified notation for better understanding

Abstract

Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN) are one of the most powerful dynamic classifiers publicly known. The network itself and the related learning algorithms are reasonably well documented to get an idea how it works. This paper will shed more light into understanding how LSTM-RNNs evolved and why they work impressively well, focusing on the early, ground-breaking publications. We significantly improved documentation and fixed a number of errors and inconsistencies that accumulated in previous publications. To support understanding we as well revised and unified the notation used.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kochlisGit/Stocks-Prediction
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Anomaly Detection Techniques and Applications · Time Series Analysis and Forecasting