YouTube-SL-25: A Large-Scale, Open-Domain Multilingual Sign Language Parallel Corpus
Garrett Tanzer, Biao Zhang

TL;DR
This paper introduces YouTube-SL-25, a large-scale, multilingual sign language video corpus from YouTube, enabling improved machine learning research across numerous sign languages, especially benefiting low-resource languages.
Contribution
The creation of YouTube-SL-25, the largest parallel sign language dataset to date, with baseline models demonstrating benefits of multilingual transfer learning.
Findings
YouTube-SL-25 contains over 3000 hours of videos across 25 sign languages.
Multilingual transfer improves sign-to-text performance for both high- and low-resource languages.
Baseline models achieve promising results across multiple sign languages.
Abstract
Even for better-studied sign languages like American Sign Language (ASL), data is the bottleneck for machine learning research. The situation is worse yet for the many other sign languages used by Deaf/Hard of Hearing communities around the world. In this paper, we present YouTube-SL-25, a large-scale, open-domain multilingual corpus of sign language videos with seemingly well-aligned captions drawn from YouTube. With >3000 hours of videos across >25 sign languages, YouTube-SL-25 is a) >3x the size of YouTube-ASL, b) the largest parallel sign language dataset to date, and c) the first or largest parallel dataset for many of its component languages. We provide baselines for sign-to-text tasks using a unified multilingual multitask model based on T5 and report scores on benchmarks across 4 sign languages. The results demonstrate that multilingual transfer benefits both higher- and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Subtitles and Audiovisual Media
MethodsGated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · 7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Byte Pair Encoding · Linear Layer · SentencePiece · Softmax · Multi-Head Attention · Dense Connections
