Target-Side Context for Discriminative Models in Statistical Machine   Translation

Ale\v{s} Tamchyna; Alexander Fraser; Ond\v{r}ej Bojar; Marcin; Junczys-Dowmunt

arXiv:1607.01149·cs.CL·July 6, 2016

Target-Side Context for Discriminative Models in Statistical Machine Translation

Ale\v{s} Tamchyna, Alexander Fraser, Ond\v{r}ej Bojar, Marcin, Junczys-Dowmunt

PDF

TL;DR

This paper introduces a novel target-side context model for statistical machine translation that improves translation quality by better capturing morphological coherence, efficiently integrated into decoding, and scalable to large datasets.

Contribution

It extends discriminative translation models by incorporating target context, enhancing translation quality and morphological coherence in a scalable, efficient manner.

Findings

01

Consistent improvements across four language pairs.

02

Efficient integration into decoding process.

03

Enhanced morphological coherence capture.

Abstract

Discriminative translation models utilizing source context have been shown to help statistical machine translation performance. We propose a novel extension of this work using target context information. Surprisingly, we show that this model can be efficiently integrated directly in the decoding process. Our approach scales to large training data sizes and results in consistent improvements in translation quality on four language pairs. We also provide an analysis comparing the strengths of the baseline source-context model with our extended source-context and target-context model and we show that our extension allows us to better capture morphological coherence. Our work is freely available as part of Moses.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.