Targeted Linguistic Analysis of Sign Language Models with Minimal Translation Pairs
Serpil Karab\"ukl\"u, Kanishka Misra, Shester Gueuwou, Diane Brentari, Greg Shakhnarovich, Karen Livescu

TL;DR
This paper introduces ASL-MTP, a new benchmark dataset for analyzing sign language models' understanding of linguistic phenomena and cue utilization, revealing reliance on manual cues over non-manual cues.
Contribution
It provides a novel dataset for detailed linguistic analysis of sign language models and demonstrates a targeted evaluation approach using ablation studies.
Findings
Models perform above chance on most phenomena.
Models rely heavily on manual cues.
Models often miss crucial non-manual cues.
Abstract
Models of sign language have historically lagged behind those for spoken language (text and speech). Recent work has greatly improved their performance on tasks like sign language translation and isolated sign recognition. However, it remains unclear to what extent existing models capture various linguistic phenomena of sign language, and how well they use cues from the multiple articulators used in sign language (hands, upper body, face). We introduce a new benchmark dataset for American Sign Language, ASL Minimal Translation Pairs (ASL-MTP), divided into multiple types of sign language phenomena and corresponding minimal pairs of translations, for performing such linguistic analyses. As a case study, we use ASL-MTP to analyze a state-of-the-art ASL-to-English translation model. We conduct a targeted analysis of the model by ablating various input cues during training and inference and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
