Toward Efficient Deep Spiking Neuron Networks:A Survey On Compression
Hui Xie, Ge Yang, Wenjuan Gao

TL;DR
This survey reviews methods for compressing and improving the efficiency of Deep Spiking Neural Networks, emphasizing techniques like pruning, quantization, and spike reduction to enable practical deployment.
Contribution
It provides a comprehensive overview of existing compression techniques for DSNNs, addressing a research gap in their efficiency and deployment.
Findings
Pruning and quantization effectively reduce DSNN model size.
Spike firing reduction decreases energy consumption.
Future research should focus on hardware-aware compression methods.
Abstract
With the rapid development of deep learning, Deep Spiking Neural Networks (DSNNs) have emerged as promising due to their unique spike event processing and asynchronous computation. When deployed on neuromorphic chips, DSNNs offer significant power advantages over Deep Artificial Neural Networks (DANNs) and eliminate time and energy consuming multiplications due to the binary nature of spikes (0 or 1). Additionally, DSNNs excel in processing temporal information, making them potentially superior for handling temporal data compared to DANNs. However, their deep network structure and numerous parameters result in high computational costs and energy consumption, limiting real-life deployment. To enhance DSNNs efficiency, researchers have adapted methods from DANNs, such as pruning, quantization, and knowledge distillation, and developed specific techniques like reducing spike firing and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Ferroelectric and Negative Capacitance Devices
MethodsPruning
