Measuring and Increasing Context Usage in Context-Aware Machine   Translation

Patrick Fernandes; Kayo Yin; Graham Neubig; Andr\'e F. T. Martins

arXiv:2105.03482·cs.CL·June 3, 2021

Measuring and Increasing Context Usage in Context-Aware Machine Translation

Patrick Fernandes, Kayo Yin, Graham Neubig, Andr\'e F. T. Martins

PDF

1 Repo

TL;DR

This paper introduces a metric to measure context utilization in neural machine translation, and proposes a training method to enhance context usage, resulting in improved translation quality and coherence.

Contribution

It presents a new metric for quantifying context usage and a simple training technique to increase context utilization in document-level translation models.

Findings

01

Target context is referenced more than source context.

02

Longer context conditioning has diminishing returns.

03

Enhanced context usage improves translation quality and coherence.

Abstract

Recent work in neural machine translation has demonstrated both the necessity and feasibility of using inter-sentential context -- context from sentences other than those currently being translated. However, while many current methods present model architectures that theoretically can use this extra context, it is often not clear how much they do actually utilize it at translation time. In this paper, we introduce a new metric, conditional cross-mutual information, to quantify the usage of context by these models. Using this metric, we measure how much document-level machine translation systems use particular varieties of context. We find that target context is referenced more than source context, and that conditioning on a longer context has a diminishing effect on results. We then introduce a new, simple training method, context-aware word dropout, to increase the usage of context by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neulab/contextual-mt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.