Transferring Cross-domain Knowledge for Video Sign Language Recognition

Dongxu Li; Xin Yu; Chenchen Xu; Lars Petersson; Hongdong Li

arXiv:2003.03703·cs.CV·March 18, 2020·19 cites

Transferring Cross-domain Knowledge for Video Sign Language Recognition

Dongxu Li, Xin Yu, Chenchen Xu, Lars Petersson, Hongdong Li

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel approach for video sign language recognition that leverages subtitled news videos by transferring domain-invariant visual concepts, significantly improving recognition accuracy despite domain gaps.

Contribution

The paper proposes a method to transfer knowledge from subtitled sign news videos to improve word-level sign language recognition models, addressing data annotation challenges.

Findings

01

Outperforms previous state-of-the-art methods on standard datasets.

02

Achieves 28.1 [email protected] in sign localization.

03

Effectively transfers knowledge across domain gaps.

Abstract

Word-level sign language recognition (WSLR) is a fundamental task in sign language interpretation. It requires models to recognize isolated sign words from videos. However, annotating WSLR data needs expert knowledge, thus limiting WSLR dataset acquisition. On the contrary, there are abundant subtitled sign news videos on the internet. Since these videos have no word-level annotation and exhibit a large domain gap from isolated signs, they cannot be directly used for training WSLR models. We observe that despite the existence of a large domain gap, isolated and news signs share the same visual concepts, such as hand gestures and body movements. Motivated by this observation, we propose a novel method that learns domain-invariant visual concepts and fertilizes WSLR models by transferring knowledge of subtitled news sign to them. To this end, we extract news signs using a base WSLR model,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Transferring Cross-Domain Knowledge for Video Sign Language Recognition· youtube

Taxonomy

TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Hearing Impairment and Communication