Phi: Leveraging Pattern-based Hierarchical Sparsity for High-Efficiency Spiking Neural Networks
Chiyue Wei, Bowen Duan, Cong Guo, Jingyang Zhang, Qingyue Song, Hai "Helen" Li, Yiran Chen

TL;DR
This paper introduces Phi, a pattern-based hierarchical sparsity framework for Spiking Neural Networks that significantly reduces computation and energy consumption by exploiting activation patterns through a combined algorithm-hardware approach.
Contribution
The paper proposes a novel two-level sparsity hierarchy and a dedicated hardware architecture, improving SNN efficiency by leveraging pattern-based activation sparsity.
Findings
Achieves 3.45x speedup over state-of-the-art SNN accelerators.
Provides 4.93x energy efficiency improvement.
Demonstrates effective pattern-based sparsity exploitation in SNNs.
Abstract
Spiking Neural Networks (SNNs) are gaining attention for their energy efficiency and biological plausibility, utilizing 0-1 activation sparsity through spike-driven computation. While existing SNN accelerators exploit this sparsity to skip zero computations, they often overlook the unique distribution patterns inherent in binary activations. In this work, we observe that particular patterns exist in spike activations, which we can utilize to reduce the substantial computation of SNN models. Based on these findings, we propose a novel \textbf{pattern-based hierarchical sparsity} framework, termed \textbf{\textit{Phi}}, to optimize computation. \textit{Phi} introduces a two-level sparsity hierarchy: Level 1 exhibits vector-wise sparsity by representing activations with pre-defined patterns, allowing for offline pre-computation with weights and significantly reducing most runtime…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
MethodsSoftmax · Attention Is All You Need · Spiking Neural Networks
