Towards AI-driven Sign Language Generation with Non-manual Markers
Han Zhang, Rotem Shalev-Arkushin, Vasileios Baltatzis, Connor Gillis,, Gierad Laput, Raja Kushalnagar, Lorna Quandt, Leah Findlater, Abdelkareem, Bedri, Colin Lea

TL;DR
This paper presents an AI system that translates English into American Sign Language videos, incorporating facial cues and body language for more natural communication, based on recent advances in language and video generation.
Contribution
It introduces a novel AI-driven sign language generation system that effectively captures non-manual markers and improves translation quality over existing methods.
Findings
User study with 30 DHH participants shows high satisfaction.
System achieves more natural and accurate sign language videos.
Identifies key areas for further improvement in sign language synthesis.
Abstract
Sign languages are essential for the Deaf and Hard-of-Hearing (DHH) community. Sign language generation systems have the potential to support communication by translating from written languages, such as English, into signed videos. However, current systems often fail to meet user needs due to poor translation of grammatical structures, the absence of facial cues and body language, and insufficient visual and motion fidelity. We address these challenges by building on recent advances in LLMs and video generation models to translate English sentences into natural-looking AI ASL signers. The text component of our model extracts information for manual and non-manual components of ASL, which are used to synthesize skeletal pose sequences and corresponding video frames. Our findings from a user study with 30 DHH participants and thorough technical evaluations demonstrate significant progress…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Speech and dialogue systems
