Toward Efficient Spiking Transformers: Synapse Pruning Meets Synergistic Learning-Based Compensation

Hongze Sun; Wuque Cai; Duo Chen; Quan Tang; Shifeng Mao; Jiayi He; Zhenxing Wang; Yan Cui; Dezhong Yao; Daqing Guo

arXiv:2508.01992·cs.LG·January 6, 2026

Toward Efficient Spiking Transformers: Synapse Pruning Meets Synergistic Learning-Based Compensation

Hongze Sun, Wuque Cai, Duo Chen, Quan Tang, Shifeng Mao, Jiayi He, Zhenxing Wang, Yan Cui, Dezhong Yao, Daqing Guo

PDF

Open Access

TL;DR

This paper introduces a combined approach of synapse pruning and a novel neuron model to create lightweight, efficient spiking Transformer models that maintain high performance with reduced computational costs.

Contribution

It proposes tailored pruning strategies and a new neuron model to significantly reduce model size and complexity while preserving performance in spiking Transformers.

Findings

01

Model size and computational costs are substantially reduced.

02

Performance remains competitive despite pruning.

03

Pruning and compensation strategies are effective in efficient model design.

Abstract

As a foundational architecture of artificial intelligence models, Transformer has been recently adapted to spiking neural networks with promising performance across various tasks. However, existing spiking Transformer(ST)-based models require a substantial number of parameters and incur high computational costs, thus limiting their deployment in resource-constrained environments. To address these challenges, we propose combining synapse pruning with a synergistic learning-based compensation strategy to derive lightweight ST-based models. Specifically, two types of tailored pruning strategies are introduced to reduce redundancy in the weight matrices of ST blocks: an unstructured $L_{1} P$ method to induce sparse representations, and a structured DSP method to induce low-rank representations. In addition, we propose an enhanced spiking neuron model, termed the synergistic leaky…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural Networks and Reservoir Computing · Ferroelectric and Negative Capacitance Devices