Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison
Dongxu Li, Cristian Rodriguez Opazo, Xin Yu, Hongdong Li

TL;DR
This paper introduces the largest publicly available large-scale sign language dataset for word-level recognition, compares deep learning models on it, and proposes a novel pose-based graph convolutional network to improve recognition accuracy.
Contribution
The paper provides a new extensive dataset for sign language recognition, benchmarks existing models, and introduces a novel pose-based graph convolutional network for improved performance.
Findings
Pose-based and appearance-based models achieve up to 66% top-10 accuracy.
The new dataset contains over 2000 words performed by 100 signers.
The Pose-TGCN model enhances pose-based recognition performance.
Abstract
Vision-based sign language recognition aims at helping deaf people to communicate with others. However, most existing sign language datasets are limited to a small number of words. Due to the limited vocabulary size, models learned from those datasets cannot be applied in practice. In this paper, we introduce a new large-scale Word-Level American Sign Language (WLASL) video dataset, containing more than 2000 words performed by over 100 signers. This dataset will be made publicly available to the research community. To our knowledge, it is by far the largest public ASL dataset to facilitate word-level sign recognition research. Based on this new large-scale dataset, we are able to experiment with several deep learning methods for word-level sign recognition and evaluate their performances in large scale scenarios. Specifically we implement and compare two different models,i.e., (i)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Gait Recognition and Analysis
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Convolution
