An Exploratory Analysis of Multilingual Word-Level Quality Estimation   with Cross-Lingual Transformers

Tharindu Ranasinghe; Constantin Orasan; Ruslan Mitkov

arXiv:2106.00143·cs.CL·June 2, 2021

An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers

Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

PDF

1 Repo

TL;DR

This paper investigates multilingual word-level Quality Estimation using cross-lingual transformers, demonstrating that such models can perform comparably to language-specific models and generalize well across different language pairs, including zero-shot and few-shot scenarios.

Contribution

It introduces multilingual QE models based on pre-trained transformers that eliminate the need for language-specific training data, enabling effective cross-lingual predictions.

Findings

01

Multilingual QE models perform on par with language-specific models.

02

Zero-shot and few-shot QE are feasible with the proposed models.

03

Transformer-based models generalize well across languages.

Abstract

Most studies on word-level Quality Estimation (QE) of machine translation focus on language-specific models. The obvious disadvantages of these approaches are the need for labelled data for each language pair and the high cost required to maintain several language-specific models. To overcome these problems, we explore different approaches to multilingual, word-level QE. We show that these QE models perform on par with the current language-specific models. In the cases of zero-shot and few-shot QE, we demonstrate that it is possible to accurately predict word-level quality for any given new language pair from models trained on other language pairs. Our findings suggest that the word-level QE models based on powerful pre-trained transformers that we propose in this paper generalise well across languages, making them more useful in real-world scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tharindudr/transQuest
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.