SpikeX: Exploring Accelerator Architecture and Network-Hardware Co-Optimization for Sparse Spiking Neural Networks
Boxun Xu, Richard Boone, Peng Li

TL;DR
SpikeX introduces a novel systolic-array accelerator for sparse spiking neural networks, optimizing hardware and network co-design to significantly enhance energy efficiency and reduce inference latency.
Contribution
The paper presents SpikeX, a new accelerator architecture and co-optimization methodology for SNNs, addressing unstructured sparsity and enabling joint hardware-aware training and architecture search.
Findings
Achieves 15.1x-150.87x reduction in energy-delay-product.
Effectively exploits unstructured sparsity for hardware efficiency.
Maintains model accuracy while improving energy and latency metrics.
Abstract
Spiking Neural Networks (SNNs) are promising biologically plausible models of computation which utilize a spiking binary activation function similar to that of biological neurons. SNNs are well positioned to process spatiotemporal data, and are advantageous in ultra-low power and real-time processing. Despite a large body of work on conventional artificial neural network accelerators, much less attention has been given to efficient SNN hardware accelerator design. In particular, SNNs exhibit inherent unstructured spatial and temporal firing sparsity, an opportunity yet to be fully explored for great hardware processing efficiency. In this work, we propose a novel systolic-array SNN accelerator architecture, called SpikeX, to take on the challenges and opportunities stemming from unstructured sparsity while taking into account the unique characteristics of spike-based computation. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
MethodsSoftmax · Attention Is All You Need · Spiking Neural Networks
