PreQuEL: Quality Estimation of Machine Translation Outputs in Advance
Shachar Don-Yehiya, Leshem Choshen, Omri Abend

TL;DR
PreQuEL introduces a novel approach to predict translation quality based solely on source text properties, reducing resource use and enhancing quality estimation by leveraging data augmentation and linguistic features.
Contribution
The paper presents the first PreQuEL task, a baseline model, and a data augmentation method that significantly improves translation quality prediction without requiring actual translations.
Findings
Data augmentation enhances prediction accuracy.
Model captures syntactic and semantic features.
Augmentation benefits general quality estimation.
Abstract
We present the task of PreQuEL, Pre-(Quality-Estimation) Learning. A PreQuEL system predicts how well a given sentence will be translated, without recourse to the actual translation, thus eschewing unnecessary resource allocation when translation quality is bound to be low. PreQuEL can be defined relative to a given MT system (e.g., some industry service) or generally relative to the state-of-the-art. From a theoretical perspective, PreQuEL places the focus on the source text, tracing properties, possibly linguistic features, that make a sentence harder to machine translate. We develop a baseline model for the task and analyze its performance. We also develop a data augmentation method (from parallel corpora), that improves results substantially. We show that this augmentation method can improve the performance of the Quality-Estimation task as well. We investigate the properties of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsAttentive Walk-Aggregating Graph Neural Network
