TEFormer: Structured Bidirectional Temporal Enhancement Modeling in Spiking Transformers
Sicheng Shen, Mingyang Lv, Bing Han, Dongcheng Zhao, Guobin Shen, Feifei Zhao, Yi Zeng

TL;DR
TEFormer introduces a novel bidirectional temporal fusion mechanism in Spiking Transformers, significantly enhancing their ability to model spatiotemporal dependencies and outperform existing models across diverse benchmarks.
Contribution
It is the first framework to incorporate bidirectional temporal fusion in Spiking Transformers, combining parallel attention-based fusion with backward recurrent aggregation.
Findings
Consistently outperforms baseline SNN and Spiking Transformer models.
Performance remains stable across different neural encoding schemes.
Establishes TEFormer as a general framework for temporal modeling in SNNs.
Abstract
In recent years, Spiking Neural Networks (SNNs) have achieved remarkable progress, with Spiking Transformers emerging as a promising architecture for energy-efficient sequence modeling. However, existing Spiking Transformers still lack a principled mechanism for effective temporal fusion, limiting their ability to fully exploit spatiotemporal dependencies. Inspired by feedforward-feedback modulation in the human visual pathway, we propose TEFormer, the first Spiking Transformer framework that achieves bidirectional temporal fusion by decoupling temporal modeling across its core components. Specifically, TEFormer employs a lightweight and hyperparameter-free forward temporal fusion mechanism in the attention module, enabling fully parallel computation, while incorporating a backward gated recurrent structure in the MLP to aggregate temporal information in reverse order and reinforce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural dynamics and brain function
