# Slim LSTM networks: LSTM_6 and LSTM_C6

**Authors:** Atra Akandeh, Fathi M. Salem

arXiv: 1901.06401 · 2019-01-23

## TL;DR

This paper demonstrates that highly parameter-reduced LSTM variants, LSTM_6 and LSTM_C6, perform comparably to standard LSTM on multiple datasets, offering faster training suitable for resource-limited devices.

## Contribution

Introduces and evaluates two simplified LSTM variants, LSTM_6 and LSTM_C6, showing they maintain performance while reducing parameters and computational cost.

## Key findings

- LSTM_6 and LSTM_C6 are competitive with standard LSTM on IMDB and 20 Newsgroup datasets.
- Reduced-parameter LSTM variants enable faster training and inference.
- Proper hyper-parameter tuning is crucial for maintaining performance.

## Abstract

We have shown previously that our parameter-reduced variants of Long Short-Term Memory (LSTM) Recurrent Neural Networks (RNN) are comparable in performance to the standard LSTM RNN on the MNIST dataset. In this study, we show that this is also the case for two diverse benchmark datasets, namely, the review sentiment IMDB and the 20 Newsgroup datasets. Specifically, we focus on two of the simplest variants, namely LSTM_6 (i.e., standard LSTM with three constant fixed gates) and LSTM_C6 (i.e., LSTM_6 with further reduced cell body input block). We demonstrate that these two aggressively reduced-parameter variants are competitive with the standard LSTM when hyper-parameters, e.g., learning parameter, number of hidden units and gate constants are set properly. These architectures enable speeding up training computations and hence, these networks would be more suitable for online training and inference onto portable devices with relatively limited computational resources.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.06401/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/1901.06401/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/1901.06401/full.md

---
Source: https://tomesphere.com/paper/1901.06401