Towards High-performance Spiking Transformers from ANN to SNN Conversion
Zihan Huang, Xinyu Shi, Zecheng Hao, Tong Bu, Jianhao Ding, Zhaofei, Yu, Tiejun Huang

TL;DR
This paper introduces a novel conversion method for transforming Transformer neural networks into spiking neural networks, achieving high accuracy with low latency and power consumption, thus advancing energy-efficient AI models.
Contribution
It presents the first successful ANN to SNN conversion for Spiking Transformers, incorporating an Expectation Compensation Module and Multi-Threshold Neuron to enhance performance.
Findings
Achieves 88.60% top-1 accuracy with 4 time steps
Reduces power consumption to 35% of original Transformer
Maintains high accuracy with only 1% loss
Abstract
Spiking neural networks (SNNs) show great potential due to their energy efficiency, fast processing capabilities, and robustness. There are two main approaches to constructing SNNs. Direct training methods require much memory, while conversion methods offer a simpler and more efficient option. However, current conversion methods mainly focus on converting convolutional neural networks (CNNs) to SNNs. Converting Transformers to SNN is challenging because of the presence of non-linear modules. In this paper, we propose an Expectation Compensation Module to preserve the accuracy of the conversion. The core idea is to use information from the previous T time-steps to calculate the expected output at time-step T. We also propose a Multi-Threshold Neuron and the corresponding Parallel Parameter normalization to address the challenge of large time steps needed for high accuracy, aiming to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
