Workload-Balanced Pruning for Sparse Spiking Neural Networks

Ruokai Yin; Youngeun Kim; Yuhang Li; Abhishek Moitra; Nitin Satpute,; Anna Hambitzer; Priyadarshini Panda

arXiv:2302.06746·cs.NE·March 26, 2024

Workload-Balanced Pruning for Sparse Spiking Neural Networks

Ruokai Yin, Youngeun Kim, Yuhang Li, Abhishek Moitra, Nitin Satpute,, Anna Hambitzer, Priyadarshini Panda

PDF

Open Access

TL;DR

This paper introduces u-Ticket, a workload-aware pruning method for sparse SNNs that ensures optimal hardware utilization, significantly reducing latency and energy consumption on resource-constrained devices.

Contribution

The paper proposes u-Ticket, a novel pruning approach that monitors and adjusts weights during LTH-based pruning to achieve perfect hardware utilization in sparse SNNs.

Findings

01

u-Ticket guarantees up to 100% hardware utilization.

02

Reduces latency by up to 76.9%.

03

Decreases energy cost by up to 63.8%.

Abstract

Pruning for Spiking Neural Networks (SNNs) has emerged as a fundamental methodology for deploying deep SNNs on resource-constrained edge devices. Though the existing pruning methods can provide extremely high weight sparsity for deep SNNs, the high weight sparsity brings a workload imbalance problem. Specifically, the workload imbalance happens when a different number of non-zero weights are assigned to hardware units running in parallel. This results in low hardware utilization and thus imposes longer latency and higher energy costs. In preliminary experiments, we show that sparse SNNs (~98% weight sparsity) can suffer as low as ~59% utilization. To alleviate the workload imbalance problem, we propose u-Ticket, where we monitor and adjust the weight connections of the SNN during Lottery Ticket Hypothesis (LTH) based pruning, thus guaranteeing the final ticket gets optimal utilization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural dynamics and brain function

MethodsPruning