Modern Neuromorphic AI: From Intra-Token to Inter-Token Processing
Osvaldo Simeone

TL;DR
This paper explores how neuromorphic principles are integrated into modern AI architectures, focusing on intra-token and inter-token processing, and reviews training methods for energy-efficient, brain-inspired models.
Contribution
It systematically connects neuromorphic models with state-space and transformer architectures, highlighting recent advances in inter-token processing techniques.
Findings
Modern neuromorphic AI incorporates quantized activations and sparse attention.
Recent approaches leverage state-space dynamics for efficient inter-token processing.
Training methods include surrogate gradients and reinforcement learning.
Abstract
The rapid growth of artificial intelligence (AI) has brought novel data processing and generative capabilities but also escalating energy requirements. This challenge motivates renewed interest in neuromorphic computing principles, which promise brain-like efficiency through discrete and sparse activations, recurrent dynamics, and non-linear feedback. In fact, modern AI architectures increasingly embody neuromorphic principles through heavily quantized activations, state-space dynamics, and sparse attention mechanisms. This paper elaborates on the connections between neuromorphic models, state-space models, and transformer architectures through the lens of the distinction between intra-token processing and inter-token processing. Most early work on neuromorphic AI was based on spiking neural networks (SNNs) for intra-token processing, i.e., for transformations involving multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
