MDQE: A More Accurate Direct Pretraining for Machine Translation Quality Estimation
Lei Lin

TL;DR
This paper introduces MDQE, a novel pretraining framework for Machine Translation Quality Estimation that generates pseudo data and pretrains estimators directly on it, leading to improved performance without relying on large pre-trained models.
Contribution
It proposes a new direct pretraining framework with a generator and estimator for QE, reducing reliance on external pretraining models like BERT.
Findings
Outperforms existing QE methods on benchmark datasets.
Does not require pretraining models such as BERT.
Achieves higher accuracy in quality estimation tasks.
Abstract
It is expensive to evaluate the results of Machine Translation(MT), which usually requires manual translation as a reference. Machine Translation Quality Estimation (QE) is a task of predicting the quality of machine translations without relying on any reference. Recently, the emergence of predictor-estimator framework which trains the predictor as a feature extractor and estimator as a QE predictor, and pre-trained language models(PLM) have achieved promising QE performance. However, we argue that there are still gaps between the predictor and the estimator in both data quality and training objectives, which preclude QE models from benefiting from a large number of parallel corpora more directly. Based on previous related work that have alleviated gaps to some extent, we propose a novel framework that provides a more accurate direct pretraining for QE tasks. In this framework, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · Residual Connection · Softmax · Weight Decay
