Sign Language Sense Disambiguation
Jana Grimm, Miriam Winkler, Oliver Kraus, Tanalp Agustoslu

TL;DR
This paper investigates how transformer models focusing on different body parts, like hands or mouth, can improve sign language disambiguation, enhancing translation accuracy for German sign language especially in varying dataset sizes.
Contribution
It introduces a body-part focused transformer approach for sign language disambiguation, demonstrating how different body parts influence performance in small versus large datasets.
Findings
Mouth-focused models perform better with small datasets.
Hand-focused models excel with larger datasets.
Results improve sign language translation accuracy.
Abstract
This project explores methods to enhance sign language translation of German sign language, specifically focusing on disambiguation of homonyms. Sign language is ambiguous and understudied which is the basis for our experiments. We approach the improvement by training transformer-based models on various bodypart representations to shift the focus on said bodypart. To determine the impact of, e.g., the hand or mouth representations, we experiment with different combinations. The results show that focusing on the mouth increases the performance in small dataset settings while shifting the focus on the hands retrieves better results in larger dataset settings. Our results contribute to better accessibility for non-hearing persons by improving the systems powering digital assistants, enabling a more accurate interaction. The code for this project can be found on GitHub.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Speech and dialogue systems
MethodsFocus
