LAS: Loss-less ANN-SNN Conversion for Fully Spike-Driven Large Language Models

Long Chen; Xiaotian Song; Yanan Sun

arXiv:2505.09659·cs.LG·May 16, 2025

LAS: Loss-less ANN-SNN Conversion for Fully Spike-Driven Large Language Models

Long Chen, Xiaotian Song, Yanan Sun

PDF

Open Access 1 Repo 1 Video

TL;DR

LAS introduces a novel loss-less conversion method for fully spike-driven LLMs, effectively handling activation outliers and nonlinearities, resulting in improved accuracy and energy efficiency in spiking models.

Contribution

LAS proposes two new neurons and tailored transformer components to enable loss-less ANN-to-SNN conversion for large language models.

Findings

01

Achieves loss-less conversion on multiple models

02

Improves accuracy by 2% on OPT-66B WSC task

03

Verifies effectiveness through parameter and ablation studies

Abstract

Spiking Large Language Models (LLMs) have emerged as an energy-efficient alternative to conventional LLMs through their event-driven computation. To effectively obtain spiking LLMs, researchers develop different ANN-to-SNN conversion methods by leveraging pre-trained ANN parameters while inheriting the energy efficiency of SNN. However, existing conversion methods struggle with extreme activation outliers and incompatible nonlinear operations of ANN-based LLMs. To address this, we propose a loss-less ANN-SNN conversion for fully spike-driven LLMs, termed LAS. Specifically, LAS introduces two novel neurons to convert the activation outlier and nonlinear operation of ANN-based LLMs. Moreover, LAS tailors the spike-equivalent Transformer components for spiking LLMs, which can ensure full spiking conversion without any loss of performance. Experimental results on six language models and two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lc783/las
pytorchOfficial

Videos

LAS: Loss-less ANN-SNN Conversion for Fully Spike-Driven Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Natural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Dropout · Layer Normalization · Byte Pair Encoding · Softmax · Absolute Position Encodings · Residual Connection