Transferring Cross-domain Knowledge for Video Sign Language Recognition
Dongxu Li, Xin Yu, Chenchen Xu, Lars Petersson, Hongdong Li

TL;DR
This paper introduces a novel approach for video sign language recognition that leverages subtitled news videos by transferring domain-invariant visual concepts, significantly improving recognition accuracy despite domain gaps.
Contribution
The paper proposes a method to transfer knowledge from subtitled sign news videos to improve word-level sign language recognition models, addressing data annotation challenges.
Findings
Outperforms previous state-of-the-art methods on standard datasets.
Achieves 28.1 [email protected] in sign localization.
Effectively transfers knowledge across domain gaps.
Abstract
Word-level sign language recognition (WSLR) is a fundamental task in sign language interpretation. It requires models to recognize isolated sign words from videos. However, annotating WSLR data needs expert knowledge, thus limiting WSLR dataset acquisition. On the contrary, there are abundant subtitled sign news videos on the internet. Since these videos have no word-level annotation and exhibit a large domain gap from isolated signs, they cannot be directly used for training WSLR models. We observe that despite the existence of a large domain gap, isolated and news signs share the same visual concepts, such as hand gestures and body movements. Motivated by this observation, we propose a novel method that learns domain-invariant visual concepts and fertilizes WSLR models by transferring knowledge of subtitled news sign to them. To this end, we extract news signs using a base WSLR model,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Transferring Cross-Domain Knowledge for Video Sign Language Recognition· youtube
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Hearing Impairment and Communication
