Fill in the Blanks: Imputing Missing Sentences for Larger-Context Neural Machine Translation
S\'ebastien Jean, Ankur Bapna, Orhan Firat

TL;DR
This paper explores methods to fill in missing context for larger-context neural machine translation, improving translation coherence and capturing long-range phenomena by generating or copying context, and validating the use of back-translation.
Contribution
It introduces and evaluates three approaches for imputing missing context in document-level translation, demonstrating their impact on translation quality and linguistic coherence.
Findings
Copy heuristic improves lexical coherence
Random contexts degrade performance on long-distance phenomena
Back-translation enhances BLEU scores and long-range modeling
Abstract
Most neural machine translation systems still translate sentences in isolation. To make further progress, a promising line of research additionally considers the surrounding context in order to provide the model potentially missing source-side information, as well as to maintain a coherent output. One difficulty in training such larger-context (i.e. document-level) machine translation systems is that context may be missing from many parallel examples. To circumvent this issue, two-stage approaches, in which sentence-level translations are post-edited in context, have recently been proposed. In this paper, we instead consider the viability of filling in the missing context. In particular, we consider three distinct approaches to generate the missing context: using random contexts, applying a copy heuristic or generating it with a language model. In particular, the copy heuristic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
