Sign language recognition from skeletal data using graph and recurrent neural networks
B. Mederos, J. Mej\'ia, A. Medina-Reyes, Y. Espinosa-Almeyda, J. D. D\'iaz-Roman, I. Rodr\'iguez-Mederos, M. Mej\'ia-Carreon, and F. Gonzalez-Lopez

TL;DR
This paper introduces a novel graph and recurrent neural network approach for recognizing sign language gestures from skeletal pose data, demonstrating high accuracy on the AUTSL dataset and highlighting the effectiveness of pose-driven methods.
Contribution
It proposes a Graph-GRU temporal network that models spatial and temporal dependencies in skeleton data for sign language recognition, a novel integration for this task.
Findings
Achieved high accuracy on AUTSL dataset
Effective modeling of spatial and temporal dependencies
Scalable framework for sign language recognition
Abstract
This work presents an approach for recognizing isolated sign language gestures using skeleton-based pose data extracted from video sequences. A Graph-GRU temporal network is proposed to model both spatial and temporal dependencies between frames, enabling accurate classification. The model is trained and evaluated on the AUTSL (Ankara university Turkish sign language) dataset, achieving high accuracy. Experimental results demonstrate the effectiveness of integrating graph-based spatial representations with temporal modeling, providing a scalable framework for sign language recognition. The results of this approach highlight the potential of pose-driven methods for sign language understanding.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Human Motion and Animation
