A Critical Review of Recurrent Neural Networks for Sequence Learning
Zachary C. Lipton, John Berkowitz, Charles Elkan

TL;DR
This paper reviews the development and advancements of recurrent neural networks (RNNs), especially LSTM and bidirectional architectures, highlighting their success in sequence learning tasks across various domains over the past three decades.
Contribution
It provides a comprehensive synthesis of RNN research history, clarifies conflicting notation, and explains recent breakthroughs making RNNs practically viable for large-scale sequence tasks.
Findings
RNNs effectively model sequential data across multiple domains.
Advances in architectures and training techniques have improved RNN performance.
LSTM and bidirectional RNNs have achieved state-of-the-art results in tasks like translation and recognition.
Abstract
Countless learning tasks require dealing with sequential data. Image captioning, speech synthesis, and music generation all require that a model produce outputs that are sequences. In other domains, such as time series prediction, video analysis, and musical information retrieval, a model must learn from inputs that are sequences. Interactive tasks, such as translating natural language, engaging in dialogue, and controlling a robot, often demand both capabilities. Recurrent neural networks (RNNs) are connectionist models that capture the dynamics of sequences via cycles in the network of nodes. Unlike standard feedforward neural networks, recurrent networks retain a state that can represent information from an arbitrarily long context window. Although recurrent neural networks have traditionally been difficult to train, and often contain millions of parameters, recent advances in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
