Modeling Target-Side Inflection in Neural Machine Translation

Ale\v{s} Tamchyna; Marion Weller-Di Marco; Alexander Fraser

arXiv:1707.06012·cs.CL·September 6, 2017

Modeling Target-Side Inflection in Neural Machine Translation

Ale\v{s} Tamchyna, Marion Weller-Di Marco, Alexander Fraser

PDF

TL;DR

This paper proposes a method for neural machine translation that improves handling of morphologically rich languages by predicting lemmas and POS tags, leading to better translation quality.

Contribution

It introduces a lemma and POS tagging approach for NMT that enhances generalization over complex morphological systems.

Findings

01

Improved translation quality for Czech and German.

02

The approach outperforms standard BPE-based methods.

03

Explicit morphological information alone does not explain the improvements.

Abstract

NMT systems have problems with large vocabulary sizes. Byte-pair encoding (BPE) is a popular approach to solving this problem, but while BPE allows the system to generate any target-side word, it does not enable effective generalization over the rich vocabulary in morphologically rich languages with strong inflectional phenomena. We introduce a simple approach to overcome this problem by training a system to produce the lemma of a word and its morphologically rich POS tag, which is then followed by a deterministic generation step. We apply this strategy for English-Czech and English-German translation scenarios, obtaining improvements in both settings. We furthermore show that the improvement is not due to only adding explicit morphological information.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsByte Pair Encoding