Kirin: Improving ANN efficiency with SNN Hybridization
Chenyu Wang, Zhanglu Yan, Zhi Zhou, Xu Chen, Weng-Fai Wong

TL;DR
Kirin introduces a hybrid ANN-to-SNN conversion method that maintains accuracy while significantly reducing energy consumption and latency, leveraging spike matrix hybridization and a silence threshold mechanism.
Contribution
The paper presents a novel hybrid quantization and encoding approach for ANN-to-SNN conversion that achieves near-lossless accuracy with improved efficiency.
Findings
Achieves up to 84.66% energy reduction.
Reduces time steps by 93.75%.
Maintains near-FP16 accuracy under quantization.
Abstract
Artificial neural networks (ANNs), particularly large language models (LLMs), demonstrate powerful inference capabilities but consume substantial energy. Conversely, spiking neural networks (SNNs) exhibit exceptional energy efficiency due to their binary and event-driven characteristics, thus motivating the study of ANN-to-SNN conversion. In this process, quantization plays a pivotal role, mapping LLMs' floating-point parameters to discrete SNN parameters via the temporal dimension of the time window. However, several challenges remain in the conversion process: (i) converting high bit-width quantization values into binary spikes requires longer time windows, increasing system latency; and (ii) the inherent trade-off between the information loss of single-spike schemes and the energy costs of multi-spike ones in SNN. To address these challenges, we propose Kirin, a integer and spike…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Neural Networks and Reservoir Computing
