Sorbet: A Neuromorphic Hardware-Compatible Transformer-Based Spiking Language Model

Kaiwen Tang; Zhanglu Yan; Weng-Fai Wong

arXiv:2409.15298·cs.NE·January 5, 2026

Sorbet: A Neuromorphic Hardware-Compatible Transformer-Based Spiking Language Model

Kaiwen Tang, Zhanglu Yan, Weng-Fai Wong

PDF

Open Access 1 Video

TL;DR

Sorbet is a neuromorphic hardware-compatible transformer-based spiking language model that uses novel energy-efficient operations, achieving significant energy savings while maintaining competitive performance on language understanding benchmarks.

Contribution

Introduces Sorbet, a transformer-based spiking language model with novel softmax and normalization techniques optimized for neuromorphic hardware.

Findings

01

Achieves 27.16× energy savings over BERT.

02

Maintains competitive performance on GLUE benchmark.

03

Uses knowledge distillation and quantization for model compression.

Abstract

For reasons such as privacy, there are use cases for language models at the edge. This has given rise to small language models targeted for deployment in resource-constrained devices where energy efficiency is critical. Spiking neural networks (SNNs) offer a promising solution due to their energy efficiency, and there are already works on realizing transformer-based models on SNNs. However, key operations like softmax and layer normalization (LN) are difficult to implement on neuromorphic hardware, and many of these early works sidestepped them. To address these challenges, we introduce Sorbet, a transformer-based spiking language model that is more neuromorphic hardware-compatible. Sorbet incorporates a novel shifting-based softmax called PTsoftmax and a Bit Shifting PowerNorm (BSPN), both designed to replace the respective energy-intensive operations. By leveraging knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Sorbet: A Neuromorphic Hardware-Compatible Transformer-Based Spiking Language Model· slideslive

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Modular Robots and Swarm Intelligence

MethodsSoftmax · Layer Normalization · Knowledge Distillation