Pose-based Sign Language Recognition using GCN and BERT

Anirudh Tunga; Sai Vidyaranya Nuthalapati; Juan Wachs

arXiv:2012.00781·cs.CV·December 3, 2020

Pose-based Sign Language Recognition using GCN and BERT

Anirudh Tunga, Sai Vidyaranya Nuthalapati, Juan Wachs

PDF

TL;DR

This paper introduces a novel pose-based sign language recognition model that separately captures spatial and temporal information using GCN and BERT, achieving significant accuracy improvements on the WLASL dataset.

Contribution

The paper presents a new architecture that explicitly models spatial and temporal dependencies separately and fuses them, advancing pose-based WSLR methods.

Findings

01

Outperforms state-of-the-art pose-based methods by up to 5% accuracy

02

Uses GCN for spatial interactions and BERT for temporal dependencies

03

Achieves significant improvements on the WLASL dataset

Abstract

Sign language recognition (SLR) plays a crucial role in bridging the communication gap between the hearing and vocally impaired community and the rest of the society. Word-level sign language recognition (WSLR) is the first important step towards understanding and interpreting sign language. However, recognizing signs from videos is a challenging task as the meaning of a word depends on a combination of subtle body motions, hand configurations, and other movements. Recent pose-based architectures for WSLR either model both the spatial and temporal dependencies among the poses in different frames simultaneously or only model the temporal information without fully utilizing the spatial information. We tackle the problem of WSLR using a novel pose-based approach, which captures spatial and temporal information separately and performs late fusion. Our proposed architecture explicitly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.