# IndyLSTMs: Independently Recurrent LSTMs

**Authors:** Pedro Gonnet, Thomas Deselaers

arXiv: 1903.08023 · 2019-03-20

## TL;DR

IndyLSTMs are a new type of recurrent neural network cell with diagonal recurrent weights, leading to smaller, faster models that outperform traditional LSTMs in accuracy and overfitting resistance across multiple handwriting datasets.

## Contribution

This paper introduces IndyLSTMs, a novel recurrent cell with diagonal recurrent weights, reducing parameters and FLOPS while improving performance over standard LSTMs.

## Key findings

- IndyLSTMs outperform regular LSTMs in accuracy per parameter.
- IndyLSTMs are less prone to overfitting.
- Models with IndyLSTMs are smaller and faster.

## Abstract

We introduce Independently Recurrent Long Short-term Memory cells: IndyLSTMs. These differ from regular LSTM cells in that the recurrent weights are not modeled as a full matrix, but as a diagonal matrix, i.e.\ the output and state of each LSTM cell depends on the inputs and its own output/state, as opposed to the input and the outputs/states of all the cells in the layer. The number of parameters per IndyLSTM layer, and thus the number of FLOPS per evaluation, is linear in the number of nodes in the layer, as opposed to quadratic for regular LSTM layers, resulting in potentially both smaller and faster models. We evaluate their performance experimentally by training several models on the popular \iamondb and CASIA online handwriting datasets, as well as on several of our in-house datasets. We show that IndyLSTMs, despite their smaller size, consistently outperform regular LSTMs both in terms of accuracy per parameter, and in best accuracy overall. We attribute this improved performance to the IndyLSTMs being less prone to overfitting.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.08023/full.md

## Figures

23 figures with captions in the complete paper: https://tomesphere.com/paper/1903.08023/full.md

## References

24 references — full list in the complete paper: https://tomesphere.com/paper/1903.08023/full.md

---
Source: https://tomesphere.com/paper/1903.08023