Simplified Long Short-term Memory Recurrent Neural Networks: part III

Atra Akandeh; Fathi M. Salem

arXiv:1707.04626·cs.NE·July 18, 2017·6 cites

Simplified Long Short-term Memory Recurrent Neural Networks: part III

Atra Akandeh, Fathi M. Salem

PDF

Open Access

TL;DR

This paper introduces two new simplified LSTM variants that significantly reduce computational complexity while maintaining performance, addressing the need for efficient processing of large time-sequence data in resource-constrained environments.

Contribution

It presents and evaluates two novel simplified LSTM models with pointwise state multiplications, enhancing computational efficiency without sacrificing accuracy.

Findings

01

New LSTM variants reduce computational load

02

Performance comparable to standard LSTM models

03

Effective for processing large time-sequence data

Abstract

This is part III of three-part work. In parts I and II, we have presented eight variants for simplified Long Short Term Memory (LSTM) recurrent neural networks (RNNs). It is noted that fast computation, specially in constrained computing resources, are an important factor in processing big time-sequence data. In this part III paper, we present and evaluate two new LSTM model variants which dramatically reduce the computational load while retaining comparable performance to the base (standard) LSTM RNNs. In these new variants, we impose (Hadamard) pointwise state multiplications in the cell-memory network in addition to the gating signal networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing · Advanced Memory and Neural Computing

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory