Iterative evaluation of LSTM cells

Leandro Palma; Luis Argerich

arXiv:1807.09830·cs.LG·July 27, 2018

Iterative evaluation of LSTM cells

Leandro Palma, Luis Argerich

PDF

Open Access

TL;DR

This paper introduces an iterative modification to LSTM cells that enhances their performance by repeating computations over fixed inputs and states, improving language modeling capabilities efficiently.

Contribution

The paper proposes a novel iterative scheme for LSTM cells, significantly boosting performance without increasing parameter count substantially.

Findings

01

Improved language modeling performance

02

Comparable results with more than three times the parameters

03

Theoretical and empirical validation of the iterative approach

Abstract

In this work we present a modification in the conventional flow of information through a LSTM network, which we consider well suited for RNNs in general. The modification leads to a iterative scheme where the computations performed by the LSTM cell are repeated over a constant input and cell state values, while updating the hidden state a finite number of times. We provide theoretical and empirical evidence to support the augmented capabilities of the iterative scheme and show examples related to language modeling. The modification yields an enhancement in the model performance comparable with the original model augmented more than 3 times in terms of the total amount of parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Machine Learning and ELM