SyriSign: A Parallel Corpus for Arabic Text to Syrian Arabic Sign Language Translation
Mohammad Amer Khalil, Raghad Nahas, Ahmad Nassar, Khloud Al Jallad

TL;DR
This paper introduces SyriSign, a new dataset of 1500 videos for Syrian Arabic Sign Language translation, aiming to support research in low-resource sign language processing.
Contribution
It provides the first publicly available dataset for Syrian Arabic Sign Language translation and evaluates deep learning models on this resource.
Findings
Deep learning models show potential but are limited by dataset size.
SyriSign can serve as an initial benchmark for future research.
Generative approaches can effectively represent sign language but need more data.
Abstract
Sign language is the primary approach of communication for the Deaf and Hard-of-Hearing (DHH) community. While there are numerous benchmarks for high-resource sign languages, low-resource languages like Arabic remain underrepresented. Currently, there is no publicly available dataset for Syrian Arabic Sign Language (SyArSL). To overcome this gap, we introduce SyriSign, a dataset comprising 1500 video samples across 150 unique lexical signs, designed for text-to-SyArSL translation tasks. This work aims to reduce communication barriers in Syria, as most news are delivered in spoken or written Arabic, which is often inaccessible to the deaf community. We evaluated SyriSign using three deep learning architectures: MotionCLIP for semantic motion generation, T2M-GPT for text-conditioned motion synthesis, and SignCLIP for bilingual embedding alignment. Experimental results indicate that while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
