DISTA: Denoising Spiking Transformer with intrinsic plasticity and   spatiotemporal attention

Boxun Xu; Hejia Geng; Yuxuan Yin; Peng Li

arXiv:2311.09376·cs.NE·November 17, 2023·1 cites

DISTA: Denoising Spiking Transformer with intrinsic plasticity and spatiotemporal attention

Boxun Xu, Hejia Geng, Yuxuan Yin, Peng Li

PDF

Open Access

TL;DR

DISTA introduces a novel denoising spiking transformer that leverages intrinsic and spatiotemporal attention mechanisms, achieving state-of-the-art results in vision tasks with ultra-low latency and power efficiency.

Contribution

The paper proposes DISTA, a spiking transformer with intrinsic plasticity and spatiotemporal attention, enhancing neural computation and performance in vision applications.

Findings

01

Achieves 96.26% top-1 accuracy on CIFAR10 with 6 time steps.

02

Outperforms previous spiking transformers on neuromorphic datasets.

03

Uses joint training of synaptic and intrinsic plasticity for optimal performance.

Abstract

Among the array of neural network architectures, the Vision Transformer (ViT) stands out as a prominent choice, acclaimed for its exceptional expressiveness and consistent high performance in various vision applications. Recently, the emerging Spiking ViT approach has endeavored to harness spiking neurons, paving the way for a more brain-inspired transformer architecture that thrives in ultra-low power operations on dedicated neuromorphic hardware. Nevertheless, this approach remains confined to spatial self-attention and doesn't fully unlock the potential of spiking neural networks. We introduce DISTA, a Denoising Spiking Transformer with Intrinsic Plasticity and SpatioTemporal Attention, designed to maximize the spatiotemporal computational prowess of spiking neurons, particularly for vision applications. DISTA explores two types of spatiotemporal attentions: intrinsic neuron-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · CCD and CMOS Imaging Sensors

MethodsAttention Is All You Need · Adam · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Absolute Position Encodings · Byte Pair Encoding · Dropout · Layer Normalization · Transformer