Target-Side Context for Discriminative Models in Statistical Machine Translation
Ale\v{s} Tamchyna, Alexander Fraser, Ond\v{r}ej Bojar, Marcin, Junczys-Dowmunt

TL;DR
This paper introduces a novel target-side context model for statistical machine translation that improves translation quality by better capturing morphological coherence, efficiently integrated into decoding, and scalable to large datasets.
Contribution
It extends discriminative translation models by incorporating target context, enhancing translation quality and morphological coherence in a scalable, efficient manner.
Findings
Consistent improvements across four language pairs.
Efficient integration into decoding process.
Enhanced morphological coherence capture.
Abstract
Discriminative translation models utilizing source context have been shown to help statistical machine translation performance. We propose a novel extension of this work using target context information. Surprisingly, we show that this model can be efficiently integrated directly in the decoding process. Our approach scales to large training data sizes and results in consistent improvements in translation quality on four language pairs. We also provide an analysis comparing the strengths of the baseline source-context model with our extended source-context and target-context model and we show that our extension allows us to better capture morphological coherence. Our work is freely available as part of Moses.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
