Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries
Maty\'a\v{s} Boh\'a\v{c}ek, Marek Hr\'uz

TL;DR
This paper introduces a new few-shot sign language recognition approach leveraging online dictionaries, along with a novel dataset, achieving state-of-the-art results and promoting accessible sign language technology.
Contribution
The work presents the UWB-SL-Wild dataset and a novel few-shot training method for sign language recognition, enabling better generalization with limited data.
Findings
Achieved top-1 accuracy of 30.97% on ASLLVD-Skeleton.
Achieved top-1 accuracy of 95.45% on ASLLVD-Skeleton-20.
Introduced the first dataset of its kind from online dictionaries.
Abstract
Today's sign language recognition models require large training corpora of laboratory-like videos, whose collection involves an extensive workforce and financial resources. As a result, only a handful of such systems are publicly available, not to mention their limited localization capabilities for less-populated sign languages. Utilizing online text-to-video dictionaries, which inherently hold annotated data of various attributes and sign languages, and training models in a few-shot fashion hence poses a promising path for the democratization of this technology. In this work, we collect and open-source the UWB-SL-Wild few-shot dataset, the first of its kind training resource consisting of dictionary-scraped videos. This dataset represents the actual distribution and characteristics of available online sign language data. We select glosses that directly overlap with the already existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Hearing Impairment and Communication
