LoAS: Fully Temporal-Parallel Dataflow for Dual-Sparse Spiking Neural Networks
Ruokai Yin, Youngeun Kim, Di Wu, Priyadarshini Panda

TL;DR
This paper introduces LoAS, a novel accelerator with a fully temporal-parallel dataflow designed to efficiently process dual-sparse SNNs, significantly improving speed and energy efficiency over prior accelerators.
Contribution
The paper proposes a fully temporal-parallel dataflow and associated techniques for dual-sparse SNNs, enabling high efficiency and low latency in SNN acceleration.
Findings
Up to 8.51x speedup over prior accelerators
Up to 3.68x energy reduction
Effective compression and inner-join circuits for SNNs
Abstract
Spiking Neural Networks (SNNs) have gained significant research attention in the last decade due to their potential to drive resource-constrained edge devices. Though existing SNN accelerators offer high efficiency in processing sparse spikes with dense weights, opportunities are less explored in SNNs with sparse weights, i.e., dual-sparsity. In this work, we study the acceleration of dual-sparse SNNs, focusing on their core operation, sparse-matrix-sparse-matrix multiplication (spMspM). We observe that naively running a dual-sparse SNN on existing spMspM accelerators designed for dual-sparse Artificial Neural Networks (ANNs) exhibits sub-optimal efficiency. The main challenge is that processing timesteps, a natural property of SNNs, introduces an extra loop to ANN spMspM, leading to longer latency and more memory traffic. To address the problem, we propose a fully temporal-parallel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · Neural dynamics and brain function
MethodsSoftmax · Attention Is All You Need · Spiking Neural Networks
