A recurrent neural network without chaos
Thomas Laurent, James von Brecht

TL;DR
This paper presents a simple gated RNN that matches the performance of complex models like LSTMs on language modeling tasks, with predictable and non-chaotic dynamics, unlike traditional gated architectures.
Contribution
Introduces a straightforward gated RNN with comparable performance to LSTMs and GRUs, demonstrating non-chaotic, predictable dynamics.
Findings
Achieves similar performance to LSTMs and GRUs on language modeling.
Exhibits simple, predictable, and non-chaotic dynamics.
Contrasts with standard gated architectures that show chaotic behavior.
Abstract
We introduce an exceptionally simple gated recurrent neural network (RNN) that achieves performance comparable to well-known gated architectures, such as LSTMs and GRUs, on the word-level language modeling task. We prove that our model has simple, predicable and non-chaotic dynamics. This stands in stark contrast to more standard gated architectures, whose underlying dynamical systems exhibit chaotic behavior.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Neural Networks and Applications · Natural Language Processing Techniques
