SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition
Jiaqi Chen, Yan Yang, Shizhuo Deng, Da Teng, and Liyuan Pan

TL;DR
SpikMamba introduces a novel framework combining spiking neural networks and Mamba for efficient event-based human action recognition, achieving state-of-the-art accuracy on multiple datasets.
Contribution
The paper proposes SpikMamba, a new approach that integrates energy-efficient spiking neural networks with Mamba for improved event-based HAR.
Findings
Surpasses previous state-of-the-art accuracy on PAF, HARDVS, DVS128, and E-FAction datasets.
Uses spiking window-based linear attention for better locality modeling.
Demonstrates high recognition performance with efficient processing of sparse event data.
Abstract
Human action recognition (HAR) plays a key role in various applications such as video analysis, surveillance, autonomous driving, robotics, and healthcare. Most HAR algorithms are developed from RGB images, which capture detailed visual information. However, these algorithms raise concerns in privacy-sensitive environments due to the recording of identifiable features. Event cameras offer a promising solution by capturing scene brightness changes sparsely at the pixel level, without capturing full images. Moreover, event cameras have high dynamic ranges that can effectively handle scenarios with complex lighting conditions, such as low light or high contrast environments. However, using event cameras introduces challenges in modeling the spatially sparse and high temporal resolution event data for HAR. To address these issues, we propose the SpikMamba framework, which combines the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Multimodal Machine Learning Applications
MethodsSoftmax · Attention Is All You Need · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
