Analyzing Uncertainty in Neural Machine Translation

Myle Ott; Michael Auli; David Grangier; Marc'Aurelio Ranzato

arXiv:1803.00047·cs.CL·August 14, 2018·55 cites

Analyzing Uncertainty in Neural Machine Translation

Myle Ott, Michael Auli, David Grangier, Marc'Aurelio Ranzato

PDF

Open Access 1 Repo

TL;DR

This paper investigates the inherent and extrinsic uncertainties in neural machine translation, proposing new tools and metrics to evaluate and improve model calibration and translation diversity.

Contribution

It introduces novel methods for assessing uncertainty in NMT models and provides practical tools to enhance translation quality and diversity.

Findings

01

Search strategies are effective despite models spreading probability mass too broadly.

02

Models tend to underestimate rare words and lack diversity in translations.

03

Tools for assessing and fixing model calibration are proposed.

Abstract

Machine translation is a popular test bed for research in neural sequence-to-sequence models but despite much recent research, there is still a lack of understanding of these models. Practitioners report performance degradation with large beams, the under-estimation of rare words and a lack of diversity in the final translations. Our study relates some of these issues to the inherent uncertainty of the task, due to the existence of multiple valid translations for a single source sentence, and to the extrinsic uncertainty caused by noisy training data. We propose tools and metrics to assess how uncertainty in the data is captured by the model distribution and how it affects search strategies that generate translations. Our results show that search works remarkably well but that models tend to spread too much probability mass over the hypothesis space. Next, we propose tools to assess…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/analyzing-uncertainty-nmt
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications