DISTA: Denoising Spiking Transformer with intrinsic plasticity and spatiotemporal attention
Boxun Xu, Hejia Geng, Yuxuan Yin, Peng Li

TL;DR
DISTA introduces a novel denoising spiking transformer that leverages intrinsic and spatiotemporal attention mechanisms, achieving state-of-the-art results in vision tasks with ultra-low latency and power efficiency.
Contribution
The paper proposes DISTA, a spiking transformer with intrinsic plasticity and spatiotemporal attention, enhancing neural computation and performance in vision applications.
Findings
Achieves 96.26% top-1 accuracy on CIFAR10 with 6 time steps.
Outperforms previous spiking transformers on neuromorphic datasets.
Uses joint training of synaptic and intrinsic plasticity for optimal performance.
Abstract
Among the array of neural network architectures, the Vision Transformer (ViT) stands out as a prominent choice, acclaimed for its exceptional expressiveness and consistent high performance in various vision applications. Recently, the emerging Spiking ViT approach has endeavored to harness spiking neurons, paving the way for a more brain-inspired transformer architecture that thrives in ultra-low power operations on dedicated neuromorphic hardware. Nevertheless, this approach remains confined to spatial self-attention and doesn't fully unlock the potential of spiking neural networks. We introduce DISTA, a Denoising Spiking Transformer with Intrinsic Plasticity and SpatioTemporal Attention, designed to maximize the spatiotemporal computational prowess of spiking neurons, particularly for vision applications. DISTA explores two types of spatiotemporal attentions: intrinsic neuron-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · CCD and CMOS Imaging Sensors
MethodsAttention Is All You Need · Adam · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Absolute Position Encodings · Byte Pair Encoding · Dropout · Layer Normalization · Transformer
