Transcribing Natural Languages for The Deaf via Neural Editing Programs
Dongxu Li, Chenchen Xu, Liu Liu, Yiran Zhong, Rong Wang, Lars, Petersson, Hongdong Li

TL;DR
This paper introduces a neural editing program approach for transcribing spoken language into sign language glosses, improving accuracy over previous sequence-to-sequence models by learning to execute editing actions.
Contribution
It proposes a novel neural agent that learns to synthesize and execute editing programs for glossification, capturing complex language connections more effectively.
Findings
Outperforms previous models significantly
Uses editing actions like addition, deletion, copying
Achieves higher transcription quality
Abstract
This work studies the task of glossification, of which the aim is to em transcribe natural spoken language sentences for the Deaf (hard-of-hearing) community to ordered sign language glosses. Previous sequence-to-sequence language models trained with paired sentence-gloss data often fail to capture the rich connections between the two distinct languages, leading to unsatisfactory transcriptions. We observe that despite different grammars, glosses effectively simplify sentences for the ease of deaf communication, while sharing a large portion of vocabulary with sentences. This has motivated us to implement glossification by executing a collection of editing actions, e.g. word addition, deletion, and copying, called editing programs, on their natural spoken language counterparts. Specifically, we design a new neural agent that learns to synthesize and execute editing programs, conditioned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Tactile and Sensory Interactions
