Neural Machine Translation with Extended Context
J\"org Tiedemann, Yves Scherrer

TL;DR
This paper explores the impact of extended context in neural machine translation, showing that larger context segments can improve coherence and are robust without degrading translation quality.
Contribution
It introduces methods for incorporating extended source and bilingual context in attention-based neural machine translation models.
Findings
Models learn to differentiate information from multiple segments.
Extended context improves textual coherence in some cases.
Models remain robust with respect to translation quality.
Abstract
We investigate the use of extended context in attention-based neural machine translation. We base our experiments on translated movie subtitles and discuss the effect of increasing the segments beyond single translation units. We study the use of extended source language context as well as bilingual context extensions. The models learn to distinguish between information from different segments and are surprisingly robust with respect to translation quality. In this pilot study, we observe interesting cross-sentential attention patterns that improve textual coherence in translation at least in some selected cases.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
