Maelstrom Networks
Matthew Evanusa, Cornelia Ferm\"uller, Yiannis Aloimonos

TL;DR
Maelstrom Networks introduce a novel neural network paradigm combining recurrent and feed-forward components, enabling efficient online processing, dynamic working memory, and potential for neuromorphic hardware implementation, advancing towards artificial networks with a sense of "self."
Contribution
The paper proposes Maelstrom Networks, a new paradigm that separates recurrent dynamics from learning, enhancing online processing and memory in neural networks, and paving the way for neuromorphic and continual learning.
Findings
Enables online sequence processing without unrolling the network.
Provides a dynamic working memory through recurrent components.
Facilitates implementation in neuromorphic hardware.
Abstract
Artificial Neural Networks has struggled to devise a way to incorporate working memory into neural networks. While the ``long term'' memory can be seen as the learned weights, the working memory consists likely more of dynamical activity, that is missing from feed-forward models. Current state of the art models such as transformers tend to ``solve'' this by ignoring working memory entirely and simply process the sequence as an entire piece of data; however this means the network cannot process the sequence in an online fashion, and leads to an immense explosion in memory requirements. Here, inspired by a combination of controls, reservoir computing, deep learning, and recurrent neural networks, we offer an alternative paradigm that combines the strength of recurrent networks, with the pattern matching capability of feed-forward neural networks, which we call the \textit{Maelstrom…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Reservoir Computing · Ferroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing
