Systolic Array-based Accelerator for Structured State-Space Models
Shiva Raja, Cansu Demirkiran, Aakash Sarkar, Milos Popovic, Ajay Joshi

TL;DR
This paper presents EpochCore, a systolic array-based hardware accelerator optimized for State-Space Models, significantly improving the efficiency and performance of long-range sequence processing in AI applications.
Contribution
The paper introduces a novel systolic array architecture with a versatile processing element and a specialized dataflow to accelerate SSM inference efficiently.
Findings
2000x performance improvement over GPU on LRA datasets
250x performance gain over traditional SA accelerators
45x energy efficiency improvement
Abstract
Sequence modeling is crucial for AI to understand temporal data and detect complex time-dependent patterns. While recurrent neural networks (RNNs), convolutional neural networks (CNNs), and Transformers have advanced in capturing long-range dependencies, they struggle with achieving high accuracy with very long sequences due to limited memory retention (fixed context window). State-Space Models (SSMs) leverage exponentially decaying memory enabling lengthy context window and so they process very long data sequences more efficiently than recurrent and Transformer-based models. Unlike traditional neural models like CNNs and RNNs, SSM-based models require solving differential equations through continuous integration, making training and inference both compute- and memory-intensive on conventional CPUs and GPUs. In this paper we introduce a specialized hardware accelerator, EpochCore, for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
