TransQuest at WMT2020: Sentence-Level Direct Assessment

Tharindu Ranasinghe; Constantin Orasan; Ruslan Mitkov

arXiv:2010.05318·cs.CL·October 13, 2020

TransQuest at WMT2020: Sentence-Level Direct Assessment

Tharindu Ranasinghe, Constantin Orasan, Ruslan Mitkov

PDF

1 Repo

TL;DR

This paper introduces a transformer-based quality estimation framework for sentence-level translation assessment, achieving state-of-the-art results and winning all language pair categories in WMT 2020.

Contribution

The paper presents a simple yet effective QE framework based on cross-lingual transformers, with ensemble and data augmentation techniques, outperforming previous baselines.

Findings

01

Achieved state-of-the-art results in WMT 2020 shared task

02

Outperformed the baseline OpenKiwi in all language pairs

03

Winning solution across all evaluated language pairs

Abstract

This paper presents the team TransQuest's participation in Sentence-Level Direct Assessment shared task in WMT 2020. We introduce a simple QE framework based on cross-lingual transformers, and we use it to implement and evaluate two different neural architectures. The proposed methods achieve state-of-the-art results surpassing the results obtained by OpenKiwi, the baseline used in the shared task. We further fine tune the QE framework by performing ensemble and data augmentation. Our approach is the winning solution in all of the language pairs according to the WMT 2020 official results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tharindudr/transQuest
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.