Toward Efficient Spiking Transformers: Synapse Pruning Meets Synergistic Learning-Based Compensation
Hongze Sun, Wuque Cai, Duo Chen, Quan Tang, Shifeng Mao, Jiayi He, Zhenxing Wang, Yan Cui, Dezhong Yao, Daqing Guo

TL;DR
This paper introduces a combined approach of synapse pruning and a novel neuron model to create lightweight, efficient spiking Transformer models that maintain high performance with reduced computational costs.
Contribution
It proposes tailored pruning strategies and a new neuron model to significantly reduce model size and complexity while preserving performance in spiking Transformers.
Findings
Model size and computational costs are substantially reduced.
Performance remains competitive despite pruning.
Pruning and compensation strategies are effective in efficient model design.
Abstract
As a foundational architecture of artificial intelligence models, Transformer has been recently adapted to spiking neural networks with promising performance across various tasks. However, existing spiking Transformer(ST)-based models require a substantial number of parameters and incur high computational costs, thus limiting their deployment in resource-constrained environments. To address these challenges, we propose combining synapse pruning with a synergistic learning-based compensation strategy to derive lightweight ST-based models. Specifically, two types of tailored pruning strategies are introduced to reduce redundancy in the weight matrices of ST blocks: an unstructured method to induce sparse representations, and a structured DSP method to induce low-rank representations. In addition, we propose an enhanced spiking neuron model, termed the synergistic leaky…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural Networks and Reservoir Computing · Ferroelectric and Negative Capacitance Devices
