Word-level Deep Sign Language Recognition from Video: A New Large-scale   Dataset and Methods Comparison

Dongxu Li; Cristian Rodriguez Opazo; Xin Yu; Hongdong Li

arXiv:1910.11006·cs.CV·January 22, 2020·53 cites

Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison

Dongxu Li, Cristian Rodriguez Opazo, Xin Yu, Hongdong Li

PDF

Open Access 3 Repos 2 Datasets

TL;DR

This paper introduces the largest publicly available large-scale sign language dataset for word-level recognition, compares deep learning models on it, and proposes a novel pose-based graph convolutional network to improve recognition accuracy.

Contribution

The paper provides a new extensive dataset for sign language recognition, benchmarks existing models, and introduces a novel pose-based graph convolutional network for improved performance.

Findings

01

Pose-based and appearance-based models achieve up to 66% top-10 accuracy.

02

The new dataset contains over 2000 words performed by 100 signers.

03

The Pose-TGCN model enhances pose-based recognition performance.

Abstract

Vision-based sign language recognition aims at helping deaf people to communicate with others. However, most existing sign language datasets are limited to a small number of words. Due to the limited vocabulary size, models learned from those datasets cannot be applied in practice. In this paper, we introduce a new large-scale Word-Level American Sign Language (WLASL) video dataset, containing more than 2000 words performed by over 100 signers. This dataset will be made publicly available to the research community. To our knowledge, it is by far the largest public ASL dataset to facilitate word-level sign recognition research. Based on this new large-scale dataset, we are able to experiment with several deep learning methods for word-level sign recognition and evaluate their performances in large scale scenarios. Specifically we implement and compare two different models,i.e., (i)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Gait Recognition and Analysis

Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Convolution