A Study On the Effects of Pre-processing On Spatio-temporal Action   Recognition Using Spiking Neural Networks Trained with STDP

El-Assal Mireille; Tirilly Pierre; Bilasco Ioan Marius

arXiv:2105.14740·cs.CV·July 6, 2021

A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP

El-Assal Mireille, Tirilly Pierre, Bilasco Ioan Marius

PDF

TL;DR

This paper investigates how different pre-processing methods and temporal encoding techniques affect the ability of spiking neural networks trained with STDP to recognize actions in videos, aiming to improve their performance in spatio-temporal tasks.

Contribution

It introduces multiple methods for transposing temporal information into spike-based representations and evaluates their impact on SNN performance in action recognition tasks.

Findings

01

Certain pre-processing methods improve recognition accuracy

02

Similarity in action shape and speed influences recognition performance

03

Some encoding and fusion techniques outperform others

Abstract

There has been an increasing interest in spiking neural networks in recent years. SNNs are seen as hypothetical solutions for the bottlenecks of ANNs in pattern recognition, such as energy efficiency. But current methods such as ANN-to-SNN conversion and back-propagation do not take full advantage of these networks, and unsupervised methods have not yet reached a success comparable to advanced artificial neural networks. It is important to study the behavior of SNNs trained with unsupervised learning methods such as spike-timing dependent plasticity (STDP) on video classification tasks, including mechanisms to model motion information using spikes, as this information is critical for video understanding. This paper presents multiple methods of transposing temporal information into a static format, and then transforming the visual information into spikes using latency coding. These…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.