SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms
Xingrun Xing, Zheng Zhang, Ziyi Ni, Shitao Xiao, Yiming Ju, Siqi Fan,, Yequan Wang, Jiajun Zhang, Guoqi Li

TL;DR
SpikeLM introduces a novel elastic bi-spiking mechanism enabling fully spike-driven language models to handle both discriminative and generative tasks with higher accuracy, bridging the gap between SNNs and traditional neural networks.
Contribution
This work presents the first fully spike-driven language model with elastic bi-spiking, enhancing semantic encoding and generalization capabilities.
Findings
Achieves higher accuracy than previous SNN-based language models
Successfully handles both discriminative and generative language tasks
Bridges performance gap between SNNs and ANNs in language modeling
Abstract
Towards energy-efficient artificial intelligence similar to the human brain, the bio-inspired spiking neural networks (SNNs) have advantages of biological plausibility, event-driven sparsity, and binary activation. Recently, large-scale language models exhibit promising generalization capability, making it a valuable issue to explore more general spike-driven models. However, the binary spikes in existing SNNs fail to encode adequate semantic information, placing technological challenges for generalization. This work proposes the first fully spiking mechanism for general language tasks, including both discriminative and generative ones. Different from previous spikes with {0,1} levels, we propose a more general spike formulation with bi-directional, elastic amplitude, and elastic frequency encoding, while still maintaining the addition nature of SNNs. In a single time step, the spike is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
