Quality-Aware Decoding: Unifying Quality Estimation and Decoding

Sai Koneru; Matthias Huck; Miriam Exel; Jan Niehues

arXiv:2502.08561·cs.CL·June 3, 2025

Quality-Aware Decoding: Unifying Quality Estimation and Decoding

Sai Koneru, Matthias Huck, Miriam Exel, Jan Niehues

PDF

Open Access

TL;DR

This paper introduces a novel token-level quality estimation model integrated into the decoding process of neural machine translation, significantly improving translation quality especially in document translation tasks.

Contribution

It presents a new token-level QE model and a decoding strategy that directly incorporates quality estimation, unifying quality assessment with decoding for the first time.

Findings

01

Improved translation quality over N-best list re-ranking

02

Up to 1.39x improvement in quality metrics

03

Significant benefits in document translation tasks

Abstract

Quality Estimation (QE) models for Neural Machine Translation (NMT) predict the quality of the hypothesis without having access to the reference. An emerging research direction in NMT involves the use of QE models, which have demonstrated high correlations with human judgment and can enhance translations through Quality-Aware Decoding. Although several approaches have been proposed based on sampling multiple candidate translations and picking the best candidate, none have integrated these models directly into the decoding process. In this paper, we address this by proposing a novel token-level QE model capable of reliably scoring partial translations. We build a uni-directional QE model for this, as decoder models are inherently trained and efficient on partial sequences. We then present a decoding strategy that integrates the QE model for Quality-Aware decoding and demonstrate that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Advanced Database Systems and Queries · Bayesian Modeling and Causal Inference