Spatiotemporal Filtering for Event-Based Action Recognition
Rohan Ghosh, Anupam Gupta, Andrei Nakagawa, Alcimar Soares, Nitish, Thakor

TL;DR
This paper introduces a novel spatiotemporal filtering method for event-based cameras, enhancing action recognition accuracy by preserving temporal precision and effectively processing sparse, noisy data with CNNs.
Contribution
It proposes an unsupervised learning approach for local spatiotemporal filters directly in the spike-event domain, improving action recognition performance over existing methods.
Findings
Significant performance improvement on DVS Gesture dataset
Effective handling of sparse, noisy event data
Outperforms standard spatiotemporal filtering methods
Abstract
In this paper, we address the challenging problem of action recognition, using event-based cameras. To recognise most gestural actions, often higher temporal precision is required for sampling visual information. Actions are defined by motion, and therefore, when using event-based cameras it is often unnecessary to re-sample the entire scene. Neuromorphic, event-based cameras have presented an alternative to visual information acquisition by asynchronously time-encoding pixel intensity changes, through temporally precise spikes (10 micro-second resolution), making them well equipped for action recognition. However, other challenges exist, which are intrinsic to event-based imagers, such as higher signal-to-noise ratio, and a spatiotemporally sparse information. One option is to convert event-data into frames, but this could result in significant temporal precision loss. In this work we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Hand Gesture Recognition Systems · Human Pose and Action Recognition
