Quality-Aware Decoding: Unifying Quality Estimation and Decoding
Sai Koneru, Matthias Huck, Miriam Exel, Jan Niehues

TL;DR
This paper introduces a novel token-level quality estimation model integrated into the decoding process of neural machine translation, significantly improving translation quality especially in document translation tasks.
Contribution
It presents a new token-level QE model and a decoding strategy that directly incorporates quality estimation, unifying quality assessment with decoding for the first time.
Findings
Improved translation quality over N-best list re-ranking
Up to 1.39x improvement in quality metrics
Significant benefits in document translation tasks
Abstract
Quality Estimation (QE) models for Neural Machine Translation (NMT) predict the quality of the hypothesis without having access to the reference. An emerging research direction in NMT involves the use of QE models, which have demonstrated high correlations with human judgment and can enhance translations through Quality-Aware Decoding. Although several approaches have been proposed based on sampling multiple candidate translations and picking the best candidate, none have integrated these models directly into the decoding process. In this paper, we address this by proposing a novel token-level QE model capable of reliably scoring partial translations. We build a uni-directional QE model for this, as decoder models are inherently trained and efficient on partial sequences. We then present a decoding strategy that integrates the QE model for Quality-Aware decoding and demonstrate that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Advanced Database Systems and Queries · Bayesian Modeling and Causal Inference
