Towards the extraction of robust sign embeddings for low resource sign language recognition
Mathieu De Coster, Ellen Rushe, Ruth Holmes, Anthony Ventresque, Joni, Dambre

TL;DR
This paper compares pose estimators for sign language recognition, introduces normalization and embedding techniques to improve robustness, and demonstrates cross-lingual transferability, aiding low-resource sign languages.
Contribution
It evaluates popular pose estimators for SLR, proposes methods to enhance keypoint-based embeddings, and shows their effectiveness in cross-lingual transfer and low-resource scenarios.
Findings
Keypoint normalization and embedding improve SLR accuracy.
Transfer learning with embeddings outperforms training from scratch.
Embeddings transfer across sign languages, aiding low-resource SLR.
Abstract
Isolated Sign Language Recognition (SLR) has mostly been applied on datasets containing signs executed slowly and clearly by a limited group of signers. In real-world scenarios, however, we are met with challenging visual conditions, coarticulated signing, small datasets, and the need for signer independent models. To tackle this difficult problem, we require a robust feature extractor to process the sign language videos. One could expect human pose estimators to be ideal candidates. However, due to a domain mismatch with their training sets and challenging poses in sign language, they lack robustness on sign language data and image-based models often still outperform keypoint-based models. Furthermore, whereas the common practice of transfer learning with image-based models yields even higher accuracy, keypoint-based models are typically trained from scratch on every SLR dataset. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Human Pose and Action Recognition
MethodsOpenPose · Surrogate Lagrangian Relaxation
