An Asynchronous Multi-core Accelerator for SNN inference
Zhuo Chen, De Ma, Xiaofei Jin, Qinghui Xing, Ouwen Jin, Xin Du,, Shuibing He, Gang Pan

TL;DR
This paper introduces an asynchronous multi-core accelerator for SNN inference that removes the need for synchronization, resulting in significant improvements in speed and energy efficiency.
Contribution
It presents a novel asynchronous architecture for SNNs that eliminates inter-core synchronization by using dependency-aware scheduling, enhancing performance and energy efficiency.
Findings
Achieves 1.86x speedup over synchronized architectures.
Attains 1.55x higher energy efficiency.
Effective across multiple SNN workloads.
Abstract
Spiking Neural Networks (SNNs) are extensively utilized in brain-inspired computing and neuroscience research. To enhance the speed and energy efficiency of SNNs, several many-core accelerators have been developed. However, maintaining the accuracy of SNNs often necessitates frequent explicit synchronization among all cores, which presents a challenge to overall efficiency. In this paper, we propose an asynchronous architecture for Spiking Neural Networks (SNNs) that eliminates the need for inter-core synchronization, thus enhancing speed and energy efficiency. This approach leverages the pre-determined dependencies of neuromorphic cores established during compilation. Each core is equipped with a scheduler that monitors the status of its dependencies, allowing it to safely advance to the next timestep without waiting for other cores. This eliminates the necessity for global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum-Dot Cellular Automata · Brain Tumor Detection and Classification · Machine Learning and ELM
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Spiking Neural Networks
