RMBR: A Regularized Minimum Bayes Risk Reranking Framework for Machine   Translation

Yidan Zhang; Yu Wan; Dayiheng Liu; Baosong Yang; Zhenan He

arXiv:2203.00201·cs.CL·March 2, 2022·1 cites

RMBR: A Regularized Minimum Bayes Risk Reranking Framework for Machine Translation

Yidan Zhang, Yu Wan, Dayiheng Liu, Baosong Yang, Zhenan He

PDF

Open Access

TL;DR

This paper introduces RMBR, a reranking framework for neural machine translation that improves translation quality by incorporating semantic similarity, translation quality, and model uncertainty into the MBR decoding process.

Contribution

The paper proposes a novel regularized MBR reranking framework that addresses limitations of traditional MBR by integrating semantic similarity, quality, and uncertainty measures.

Findings

01

RMBR outperforms standard beam search and traditional MBR in translation quality.

02

Incorporating semantic and uncertainty regularizers improves translation accuracy.

03

Extensive experiments validate the effectiveness of the proposed framework.

Abstract

Beam search is the most widely used decoding method for neural machine translation (NMT). In practice, the top-1 candidate with the highest log-probability among the n candidates is selected as the preferred one. However, this top-1 candidate may not be the best overall translation among the n-best list. Recently, Minimum Bayes Risk (MBR) decoding has been proposed to improve the quality for NMT, which seeks for a consensus translation that is closest on average to other candidates from the n-best list. We argue that MBR still suffers from the following problems: The utility function only considers the lexical-level similarity between candidates; The expected utility considers the entire n-best list which is time-consuming and inadequate candidates in the tail list may hurt the performance; Only the relationship between candidates is considered. To solve these issues, we design a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Machine Learning in Bioinformatics · Text and Document Classification Technologies