LoAS: Fully Temporal-Parallel Dataflow for Dual-Sparse Spiking Neural   Networks

Ruokai Yin; Youngeun Kim; Di Wu; Priyadarshini Panda

arXiv:2407.14073·cs.AR·September 4, 2024

LoAS: Fully Temporal-Parallel Dataflow for Dual-Sparse Spiking Neural Networks

Ruokai Yin, Youngeun Kim, Di Wu, Priyadarshini Panda

PDF

Open Access 1 Repo

TL;DR

This paper introduces LoAS, a novel accelerator with a fully temporal-parallel dataflow designed to efficiently process dual-sparse SNNs, significantly improving speed and energy efficiency over prior accelerators.

Contribution

The paper proposes a fully temporal-parallel dataflow and associated techniques for dual-sparse SNNs, enabling high efficiency and low latency in SNN acceleration.

Findings

01

Up to 8.51x speedup over prior accelerators

02

Up to 3.68x energy reduction

03

Effective compression and inner-join circuits for SNNs

Abstract

Spiking Neural Networks (SNNs) have gained significant research attention in the last decade due to their potential to drive resource-constrained edge devices. Though existing SNN accelerators offer high efficiency in processing sparse spikes with dense weights, opportunities are less explored in SNNs with sparse weights, i.e., dual-sparsity. In this work, we study the acceleration of dual-sparse SNNs, focusing on their core operation, sparse-matrix-sparse-matrix multiplication (spMspM). We observe that naively running a dual-sparse SNN on existing spMspM accelerators designed for dual-sparse Artificial Neural Networks (ANNs) exhibits sub-optimal efficiency. The main challenge is that processing timesteps, a natural property of SNNs, introduces an extra loop to ANN spMspM, leading to longer latency and more memory traffic. To address the problem, we propose a fully temporal-parallel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ruokaiyin/loas
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural Networks and Applications · Neural dynamics and brain function

MethodsSoftmax · Attention Is All You Need · Spiking Neural Networks