Hindsight Quality Prediction Experiments in Multi-Candidate Human-Post-Edited Machine Translation

Malik Marmonier; Beno\^it Sagot; Rachel Bawden

arXiv:2603.04083·cs.CL·March 5, 2026

Hindsight Quality Prediction Experiments in Multi-Candidate Human-Post-Edited Machine Translation

Malik Marmonier, Beno\^it Sagot, Rachel Bawden

PDF

Open Access

TL;DR

This study evaluates how the rise of Large Language Models affects existing methods for predicting machine translation quality, using a multi-candidate dataset and correlation metrics to compare traditional and new paradigms.

Contribution

It provides insights into the changing effectiveness of quality prediction paradigms in the context of LLM integration into MT workflows.

Findings

01

LLMs impact the reliability of traditional quality prediction methods

02

Document-level translation challenges are reduced with LLMs

03

Candidate-side QE remains valuable for quality assessment

Abstract

This paper investigates two complementary paradigms for predicting machine translation (MT) quality: source-side difficulty prediction and candidate-side quality estimation (QE). The rapid adoption of Large Language Models (LLMs) into MT workflows is reshaping the research landscape, yet its impact on established quality prediction paradigms remains underexplored. We study this issue through a series of "hindsight" experiments on a unique, multi-candidate dataset resulting from a genuine MT post-editing (MTPE) project. The dataset consists of over 6,000 English source segments with nine translation hypotheses from a diverse set of traditional neural MT systems and advanced LLMs, all evaluated against a single, final human post-edited reference. Using Kendall's rank correlation, we assess the predictive power of source-side difficulty metrics, candidate-side QE models and position…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Translation Studies and Practices · Topic Modeling