Learning from What is Already Out There: Few-shot Sign Language   Recognition with Online Dictionaries

Maty\'a\v{s} Boh\'a\v{c}ek; Marek Hr\'uz

arXiv:2301.03769·cs.CV·January 11, 2023

Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries

Maty\'a\v{s} Boh\'a\v{c}ek, Marek Hr\'uz

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new few-shot sign language recognition approach leveraging online dictionaries, along with a novel dataset, achieving state-of-the-art results and promoting accessible sign language technology.

Contribution

The work presents the UWB-SL-Wild dataset and a novel few-shot training method for sign language recognition, enabling better generalization with limited data.

Findings

01

Achieved top-1 accuracy of 30.97% on ASLLVD-Skeleton.

02

Achieved top-1 accuracy of 95.45% on ASLLVD-Skeleton-20.

03

Introduced the first dataset of its kind from online dictionaries.

Abstract

Today's sign language recognition models require large training corpora of laboratory-like videos, whose collection involves an extensive workforce and financial resources. As a result, only a handful of such systems are publicly available, not to mention their limited localization capabilities for less-populated sign languages. Utilizing online text-to-video dictionaries, which inherently hold annotated data of various attributes and sign languages, and training models in a few-shot fashion hence poses a promising path for the democratization of this technology. In this work, we collect and open-source the UWB-SL-Wild few-shot dataset, the first of its kind training resource consisting of dictionary-scraped videos. This dataset represents the actual distribution and characteristics of available online sign language data. We select glosses that directly overlap with the already existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

matyasbohacek/uwb-sl-wild
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Hearing Impairment and Communication