FAS: Fast ANN-SNN Conversion for Spiking Large Language Models

Long Chen; Xiaotian Song; Andy Song; BaDong Chen; Jiancheng Lv; Yanan Sun

arXiv:2502.04405·cs.LG·May 15, 2025

FAS: Fast ANN-SNN Conversion for Spiking Large Language Models

Long Chen, Xiaotian Song, Andy Song, BaDong Chen, Jiancheng Lv, Yanan Sun

PDF

Open Access 1 Repo

TL;DR

This paper introduces FAS, a two-stage conversion method that efficiently transforms large language models into spiking models, achieving high accuracy with significantly reduced latency and energy consumption.

Contribution

FAS is a novel two-stage ANN-SNN conversion strategy that enhances performance and reduces computational costs for spiking large language models.

Findings

01

Achieves 3% higher accuracy than OPT-7B with only 8 timesteps

02

Reduces energy consumption by 96.63%

03

Outperforms existing methods on language and vision-language tasks

Abstract

Spiking Large Language Models have been shown as a good alternative to LLMs in various scenarios. Existing methods for creating Spiking LLMs, i.e., direct training and ANN-SNN conversion, often suffer from performance degradation and relatively high computational costs. To address these issues, we propose a novel Fast ANN-SNN conversion strategy (FAS) that transforms LLMs into spiking LLMs in two stages. The first stage employs a full-parameter fine-tuning of pre-trained models, so it does not need any direct training from scratch. The second stage introduces a coarse-to-fine calibration method to reduce conversion errors and improve accuracy. Experiments on both language and vision-language tasks across four different scales of LLMs demonstrate that FAS can achieve state-of-the-art performance yet with significantly reduced inference latency and computational costs. Notably, FAS only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lc783/fas
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Robotics and Automated Systems