Context-Aware Monolingual Repair for Neural Machine Translation

Elena Voita; Rico Sennrich; Ivan Titov

arXiv:1909.01383·cs.CL·October 16, 2019·1 cites

Context-Aware Monolingual Repair for Neural Machine Translation

Elena Voita, Rico Sennrich, Ivan Titov

PDF

1 Repo

TL;DR

This paper introduces a monolingual post-editing model called DocRepair that improves the consistency of sentence translations in context, using only target language data, and demonstrates significant improvements in translation quality and coherence.

Contribution

The paper presents a novel monolingual sequence-to-sequence model for post-editing translations to ensure contextual consistency, trained solely on target language data.

Findings

01

Large improvements in contextual translation phenomena

02

Enhanced BLEU scores for English-Russian translation

03

Human evaluators prefer corrected translations

Abstract

Modern sentence-level NMT systems often produce plausible translations of isolated sentences. However, when put in context, these translations may end up being inconsistent with each other. We propose a monolingual DocRepair model to correct inconsistencies between sentence-level translations. DocRepair performs automatic post-editing on a sequence of sentence-level translations, refining translations of sentences in context of each other. For training, the DocRepair model requires only monolingual document-level data in the target language. It is trained as a monolingual sequence-to-sequence model that maps inconsistent groups of sentences into consistent ones. The consistent groups come from the original training data; the inconsistent groups are obtained by sampling round-trip translations for each isolated sentence. We show that this approach successfully imitates inconsistencies we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lena-voita/good-translation-wrong-in-context
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification