S3T-Former: A Purely Spike-Driven State-Space Topology Transformer for Skeleton Action Recognition
Naichuan Zheng, Hailun Xia, Zepeng Sun, Weiyi Li, Yujia Wang

TL;DR
This paper introduces S3T-Former, a novel spike-driven Transformer architecture for skeleton action recognition that achieves high accuracy with reduced energy consumption on resource-limited devices.
Contribution
It presents the first purely spike-driven Transformer for skeleton action recognition, utilizing novel sparse event stream transformations and topology routing for energy efficiency.
Findings
Achieves state-of-the-art accuracy in energy-efficient neuromorphic action recognition.
Demonstrates significant energy savings over traditional ANN models.
Performs well on multiple large-scale datasets.
Abstract
Skeleton-based action recognition is crucial for multimedia applications but heavily relies on power-hungry Artificial Neural Networks (ANNs), limiting their deployment on resource-constrained edge devices. Spiking Neural Networks (SNNs) provide an energy-efficient alternative; however, existing spiking models for skeleton data often compromise the intrinsic sparsity of SNNs by resorting to dense matrix aggregations, heavy multimodal fusion modules, or non-sparse frequency domain transformations. Furthermore, they severely suffer from the short-term amnesia of spiking neurons. In this paper, we propose the Spiking State-Space Topology Transformer (S3T-Former), which, to the best of our knowledge, is the first purely spike-driven Transformer architecture specifically designed for energy-efficient skeleton action recognition. Rather than relying on heavy fusion overhead, we formulate a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing
