Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks
Hongsong Wang, Liang Wang

TL;DR
This paper introduces a two-stream RNN architecture that models both temporal dynamics and spatial configurations of skeletons for action recognition, improving accuracy over traditional methods.
Contribution
The paper proposes a novel two-stream RNN framework with hierarchical and stacked structures, incorporating spatial sequence modeling and 3D data augmentation for enhanced skeleton-based action recognition.
Findings
Significant improvement on 3D action recognition benchmarks.
Effective modeling of spatial configurations improves recognition accuracy.
Data augmentation enhances model generalization.
Abstract
Recently, skeleton based action recognition gains more popularity due to cost-effective depth sensors coupled with real-time skeleton estimation algorithms. Traditional approaches based on handcrafted features are limited to represent the complexity of motion patterns. Recent methods that use Recurrent Neural Networks (RNN) to handle raw skeletons only focus on the contextual dependency in the temporal domain and neglect the spatial configurations of articulated skeletons. In this paper, we propose a novel two-stream RNN architecture to model both temporal dynamics and spatial configurations for skeleton based action recognition. We explore two different structures for the temporal stream: stacked RNN and hierarchical RNN. Hierarchical RNN is designed according to human body kinematics. We also propose two effective methods to model the spatial structure by converting the spatial graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Gait Recognition and Analysis
