Attention via Synaptic Plasticity is All You Need: A Biologically Inspired Spiking Neuromorphic Transformer
Kallol Mondal (1, 2), Ankush Kumar (2) ((1) Department of Electronics, Communication Engineering, National Institute of Technology Allahabad, Prayagraj, (2) Centre for Nanotechnology, Indian Institute of Technology Roorkee)

TL;DR
This paper introduces a biologically inspired spiking Transformer that uses synaptic plasticity for attention, achieving high accuracy and energy efficiency on image classification tasks, and aligning more closely with brain mechanisms.
Contribution
The paper presents the Spiking STDP Transformer (S$^{2}$TDPT), a novel neuromorphic attention mechanism based on spike-timing-dependent plasticity, enabling in-memory computing and energy-efficient inference.
Findings
Achieves 94.35% accuracy on CIFAR-10 with 4 timesteps.
Reduces energy consumption by 88.47% compared to standard Transformers.
Provides interpretable attention maps highlighting relevant regions.
Abstract
Attention is the brain's ability to selectively focus on a few specific aspects while ignoring irrelevant ones. This biological principle inspired the attention mechanism in modern Transformers. Transformers now underpin large language models (LLMs) such as GPT, but at the cost of massive training and inference energy, leading to a large carbon footprint. While brain attention emerges from neural circuits, Transformer attention relies on dot-product similarity to weight elements in the input sequence. Neuromorphic computing, especially spiking neural networks (SNNs), offers a brain-inspired path to energy-efficient intelligence. Despite recent work on attention-based spiking Transformers, the core attention layer remains non-neuromorphic. Current spiking attention (i) relies on dot-product or element-wise similarity suited to floating-point operations, not event-driven spikes; (ii)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
