Spiking Layer-Adaptive Magnitude-based Pruning
Junqiao Wang, Zhehang Ye, Yuqi Ouyang

TL;DR
This paper introduces SLAMP, a novel pruning framework for SNNs that optimally reduces connectivity and operations by considering temporal dynamics, leading to energy-efficient and accurate neural network deployment.
Contribution
SLAMP generalizes layer-adaptive magnitude pruning to temporal SNNs through a theory-guided, distortion-constrained optimization approach, improving pruning effectiveness.
Findings
Significant reduction in connectivity and spiking operations.
Maintains high accuracy on multiple datasets.
Enables efficient deployment of SNNs.
Abstract
Spiking Neural Networks (SNNs) provide energy-efficient computation but their deployment is constrained by dense connectivity and high spiking operation costs. Existing magnitude-based pruning strategies, when naively applied to SNNs, fail to account for temporal accumulation, non-uniform timestep contributions, and membrane stability, often leading to severe performance degradation. This paper proposes Spiking Layer-Adaptive Magnitude-based Pruning (SLAMP), a theory-guided pruning framework that generalizes layer-adaptive magnitude pruning to temporal SNNs by explicitly controlling worst-case output distortion across layers and timesteps. SLAMP formulates sparsity allocation as a temporal distortion-constrained optimization problem, yielding time-aware layer importance scores that reduce to conventional layer-adaptive pruning in single-timestep limit. An efficient two-stage procedure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural Networks and Reservoir Computing · Ferroelectric and Negative Capacitance Devices
