Exploiting the ConvLSTM: Human Action Recognition using Raw Depth Video-Based Recurrent Neural Networks
Adrian Sanchez-Caballero, David Fuentes-Jimenez, Cristina, Losada-Guti\'errez

TL;DR
This paper introduces and compares ConvLSTM-based neural networks for human action recognition in raw depth videos, demonstrating that the stateful mode improves accuracy and efficiency over the stateless mode.
Contribution
The study proposes two ConvLSTM architectures for HAR, highlighting the benefits of the stateful mode in improving recognition accuracy and computational efficiency.
Findings
Stateful ConvLSTM improves recognition accuracy over stateless.
Proposed models achieve competitive accuracy with lower computational cost.
Stateful mode significantly enhances performance in video-based HAR.
Abstract
As in many other different fields, deep learning has become the main approach in most computer vision applications, such as scene understanding, object recognition, computer-human interaction or human action recognition (HAR). Research efforts within HAR have mainly focused on how to efficiently extract and process both spatial and temporal dependencies of video sequences. In this paper, we propose and compare, two neural networks based on the convolutional long short-term memory unit, namely ConvLSTM, with differences in the architecture and the long-term learning strategy. The former uses a video-length adaptive input data generator (\emph{stateless}) whereas the latter explores the \emph{stateful} ability of general recurrent neural networks but applied in the particular case of HAR. This stateful property allows the model to accumulate discriminative patterns from previous frames…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis
MethodsTanh Activation · Sigmoid Activation · Convolution · ConvLSTM
