A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP
El-Assal Mireille, Tirilly Pierre, Bilasco Ioan Marius

TL;DR
This paper investigates how different pre-processing methods and temporal encoding techniques affect the ability of spiking neural networks trained with STDP to recognize actions in videos, aiming to improve their performance in spatio-temporal tasks.
Contribution
It introduces multiple methods for transposing temporal information into spike-based representations and evaluates their impact on SNN performance in action recognition tasks.
Findings
Certain pre-processing methods improve recognition accuracy
Similarity in action shape and speed influences recognition performance
Some encoding and fusion techniques outperform others
Abstract
There has been an increasing interest in spiking neural networks in recent years. SNNs are seen as hypothetical solutions for the bottlenecks of ANNs in pattern recognition, such as energy efficiency. But current methods such as ANN-to-SNN conversion and back-propagation do not take full advantage of these networks, and unsupervised methods have not yet reached a success comparable to advanced artificial neural networks. It is important to study the behavior of SNNs trained with unsupervised learning methods such as spike-timing dependent plasticity (STDP) on video classification tasks, including mechanisms to model motion information using spikes, as this information is critical for video understanding. This paper presents multiple methods of transposing temporal information into a static format, and then transforming the visual information into spikes using latency coding. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
