MDQE: A More Accurate Direct Pretraining for Machine Translation Quality   Estimation

Lei Lin

arXiv:2107.14600·cs.CL·April 19, 2022

MDQE: A More Accurate Direct Pretraining for Machine Translation Quality Estimation

Lei Lin

PDF

Open Access

TL;DR

This paper introduces MDQE, a novel pretraining framework for Machine Translation Quality Estimation that generates pseudo data and pretrains estimators directly on it, leading to improved performance without relying on large pre-trained models.

Contribution

It proposes a new direct pretraining framework with a generator and estimator for QE, reducing reliance on external pretraining models like BERT.

Findings

01

Outperforms existing QE methods on benchmark datasets.

02

Does not require pretraining models such as BERT.

03

Achieves higher accuracy in quality estimation tasks.

Abstract

It is expensive to evaluate the results of Machine Translation(MT), which usually requires manual translation as a reference. Machine Translation Quality Estimation (QE) is a task of predicting the quality of machine translations without relying on any reference. Recently, the emergence of predictor-estimator framework which trains the predictor as a feature extractor and estimator as a QE predictor, and pre-trained language models(PLM) have achieved promising QE performance. However, we argue that there are still gaps between the predictor and the estimator in both data quality and training objectives, which preclude QE models from benefiting from a large number of parallel corpora more directly. Based on previous related work that have alleviated gaps to some extent, we propose a novel framework that provides a more accurate direct pretraining for QE tasks. In this framework, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · Residual Connection · Softmax · Weight Decay