Spiking Neural Networks with Dynamic Time Steps for Vision Transformers
Gourav Datta, Zeyu Liu, Anni Li, Peter A. Beerel

TL;DR
This paper introduces a novel training framework for Spiking Neural Networks with dynamic time steps tailored for Vision Transformers, significantly improving energy efficiency and maintaining high accuracy on image recognition benchmarks.
Contribution
It proposes a trainable, dynamic allocation of time steps per ViT module, enabling SNNs to operate efficiently with fewer time steps and high activation sparsity.
Findings
Achieved 95.97% accuracy on CIFAR10 with 4.97 time steps.
Reduced energy consumption by using only accumulate operations.
Demonstrated effectiveness on CIFAR10, CIFAR100, and ImageNet datasets.
Abstract
Spiking Neural Networks (SNNs) have emerged as a popular spatio-temporal computing paradigm for complex vision tasks. Recently proposed SNN training algorithms have significantly reduced the number of time steps (down to 1) for improved latency and energy efficiency, however, they target only convolutional neural networks (CNN). These algorithms, when applied on the recently spotlighted vision transformers (ViT), either require a large number of time steps or fail to converge. Based on analysis of the histograms of the ANN and SNN activation maps, we hypothesize that each ViT block has a different sensitivity to the number of time steps. We propose a novel training framework that dynamically allocates the number of time steps to each ViT module depending on a trainable score assigned to each timestep. In particular, we generate a scalar binary time step mask that filters spikes emitted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · Photoreceptor and optogenetics research
MethodsSpiking Neural Networks
