The Hardware Impact of Quantization and Pruning for Weights in Spiking Neural Networks
Clemens JS Schaefer, Pooria Taheri, Mark Horeni, and Siddharth Joshi

TL;DR
This paper investigates how quantization and pruning techniques affect the performance and energy efficiency of spiking neural networks on hardware, revealing that aggressive quantization is highly effective while pruning has limitations.
Contribution
It provides a comprehensive analysis of the combined effects of quantization and pruning on SNNs, highlighting their impact on accuracy and energy consumption on specialized hardware.
Findings
Quantization down to ternary weights does not reduce accuracy.
Pruning maintains accuracy up to 80% sparsity but increases energy consumption.
Combining pruning and quantization offers a trade-off between energy efficiency and accuracy.
Abstract
Energy efficient implementations and deployments of Spiking neural networks (SNNs) have been of great interest due to the possibility of developing artificial systems that can achieve the computational powers and energy efficiency of the biological brain. Efficient implementations of SNNs on modern digital hardware are also inspired by advances in machine learning and deep neural networks (DNNs). Two techniques widely employed in the efficient deployment of DNNs -- the quantization and pruning of parameters, can both compress the model size, reduce memory footprints, and facilitate low-latency execution. The interaction between quantization and pruning and how they might impact model performance on SNN accelerators is currently unknown. We study various combinations of pruning and quantization in isolation, cumulatively, and simultaneously (jointly) to a state-of-the-art SNN targeting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · CCD and CMOS Imaging Sensors
MethodsPruning
