Deep-Temporal LSTM for Daily Living Action Recognition
Srijan Das, Michal Koperski, Francois Bremond, Gianpiero Francesca

TL;DR
This paper introduces a deep-temporal LSTM architecture that enhances daily living action recognition by combining long-term skeleton evolution with static appearance, outperforming many existing methods.
Contribution
It proposes a novel deep-temporal LSTM model that better encodes temporal information and fuses 3D skeleton geometry with static appearance for improved action recognition.
Findings
Achieved competitive results on CAD60, MSRDailyActivity3D, and NTU-RGB+D datasets.
Demonstrated the importance of modeling long-term skeleton evolution.
Validated the effectiveness of combining skeleton geometry with static appearance.
Abstract
In this paper, we propose to improve the traditional use of RNNs by employing a many to many model for video classification. We analyze the importance of modeling spatial layout and temporal encoding for daily living action recognition. Many RGB methods focus only on short term temporal information obtained from optical flow. Skeleton based methods on the other hand show that modeling long term skeleton evolution improves action recognition accuracy. In this work, we propose a deep-temporal LSTM architecture which extends standard LSTM and allows better encoding of temporal information. In addition, we propose to fuse 3D skeleton geometry with deep static appearance. We validate our approach on public available CAD60, MSRDailyActivity3D and NTU-RGB+D, achieving competitive performance as compared to the state-of-the art.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Context-Aware Activity Recognition Systems · Gait Recognition and Analysis
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
