SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision

Yasser Hamidullah; Shakib Yazdani; Cennet Oguz; Josef van Genabith; Cristina Espa\~na-Bonet

arXiv:2510.19398·cs.CL·October 23, 2025

SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision

Yasser Hamidullah, Shakib Yazdani, Cennet Oguz, Josef van Genabith, Cristina Espa\~na-Bonet

PDF

Open Access 1 Video

TL;DR

This paper introduces SONAR-SLT, a multilingual sign language translation method using language-agnostic sentence embeddings and data augmentation, improving cross-language generalization and robustness especially in low-resource scenarios.

Contribution

It proposes a novel multilingual SLT approach with language-agnostic embeddings and coupled augmentation, enhancing scalability and performance over previous methods.

Findings

01

Consistent BLEURT improvements over text-only supervision.

02

Significant gains in low-resource language settings.

03

Enhanced model robustness through data augmentation.

Abstract

Sign language translation (SLT) is typically trained with text in a single spoken language, which limits scalability and cross-language generalization. Earlier approaches have replaced gloss supervision with text-based sentence embeddings, but up to now, these remain tied to a specific language and modality. In contrast, here we employ language-agnostic, multimodal embeddings trained on text and speech from multiple languages to supervise SLT, enabling direct multilingual translation. To address data scarcity, we propose a coupled augmentation method that combines multilingual target augmentations (i.e. translations into many languages) with video-level perturbations, improving model robustness. Experiments show consistent BLEURT gains over text-only sentence embedding supervision, with larger improvements in low-resource settings. Our results demonstrate that language-agnostic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

SONAR-SLT: Multilingual Sign Language Translation via Language-Agnostic Sentence Embedding Supervision· underline

Taxonomy

TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Natural Language Processing Techniques