Beyond a Single Reference: Training and Evaluation with Paraphrases in Sign Language Translation

V\'aclav Javorek; Tom\'a\v{s} \v{Z}elezn\'y; Alessa Carbo; Marek Hr\'uz; Ivan Gruber

arXiv:2601.21128·cs.AI·January 30, 2026

Beyond a Single Reference: Training and Evaluation with Paraphrases in Sign Language Translation

V\'aclav Javorek, Tom\'a\v{s} \v{Z}elezn\'y, Alessa Carbo, Marek Hr\'uz, Ivan Gruber

PDF

Open Access

TL;DR

This paper explores using large language models to generate paraphrased references for sign language translation, improving evaluation metrics and aligning better with human judgment, while also examining the effects on training.

Contribution

It introduces BLEUpara, a new evaluation metric using multiple paraphrased references, and analyzes the impact of paraphrases on SLT training and evaluation.

Findings

01

Paraphrases improve evaluation scores when used during testing.

02

Naive incorporation of paraphrases in training does not enhance performance.

03

BLEUpara correlates more strongly with human judgments.

Abstract

Most Sign Language Translation (SLT) corpora pair each signed utterance with a single written-language reference, despite the highly non-isomorphic relationship between sign and spoken languages, where multiple translations can be equally valid. This limitation constrains both model training and evaluation, particularly for n-gram-based metrics such as BLEU. In this work, we investigate the use of Large Language Models to automatically generate paraphrased variants of written-language translations as synthetic alternative references for SLT. First, we compare multiple paraphrasing strategies and models using an adapted ParaScore metric. Second, we study the impact of paraphrases on both training and evaluation of the pose-based T5 model on the YouTubeASL and How2Sign datasets. Our results show that naively incorporating paraphrases during training does not improve translation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Natural Language Processing Techniques