QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation
Gon\c{c}alo R. A. Faria, Sweta Agrawal, Ant\'onio Farinhas, Ricardo, Rei, Jos\'e G. C. de Souza, Andr\'e F. T. Martins

TL;DR
This paper introduces QUEST, a sampling method using a quality-aware Metropolis-Hastings algorithm to generate diverse, high-quality translations in machine translation, effectively leveraging quality metrics as energy functions.
Contribution
It proposes a novel sampling approach that combines quality metrics with MCMC to improve translation diversity and quality, addressing limitations of likelihood-based methods.
Findings
Achieves higher translation quality across multiple language pairs.
Produces more diverse translations compared to traditional methods.
Effective with decoder-only large language models.
Abstract
An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware and minimum Bayes risk decoding). However, relying on a single translation with high estimated quality increases the chances of "gaming the metric''. In this paper, we address the problem of sampling a set of high-quality and diverse translations. We provide a simple and effective way to avoid over-reliance on noisy quality estimates by using them as the energy function of a Gibbs distribution. Instead of looking for a mode in the distribution, we generate multiple samples from high-density areas…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsSparse Evolutionary Training
