State-Space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era
Matteo Tiezzi, Michele Casoni, Alessandro Betti, Marco Gori, Stefano Melacci

TL;DR
This survey reviews recent advances in recurrent and state-space models for long sequence processing, highlighting their resurgence amid the dominance of Transformers and discussing future research directions.
Contribution
It provides a comprehensive taxonomy of recent recurrent-based architectures and algorithms, emphasizing new learning methods beyond traditional backpropagation for online sequence processing.
Findings
Recurrent models are experiencing a revival due to large-context Transformers.
Emerging approaches focus on online learning with local-forward computations.
There is significant potential for novel algorithms beyond backpropagation.
Abstract
Effectively learning from sequential data is a longstanding goal of Artificial Intelligence, especially in the case of long sequences. From the dawn of Machine Learning, several researchers have pursued algorithms and architectures capable of processing sequences of patterns, retaining information about past inputs while still leveraging future data, without losing precious long-term dependencies and correlations. While such an ultimate goal is inspired by the human hallmark of continuous real-time processing of sensory information, several solutions have simplified the learning paradigm by artificially limiting the processed context or dealing with sequences of limited length, given in advance. These solutions were further emphasized by the ubiquity of Transformers, which initially overshadowed the role of Recurrent Neural Nets. However, recurrent networks are currently experiencing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Blind Source Separation Techniques
