Morphological Constraints for Phrase Pivot Statistical Machine Translation
Ahmed El Kholy, Nizar Habash

TL;DR
This paper explores the use of morphological constraints to enhance phrase pivot statistical machine translation, addressing issues caused by poor morphology languages as pivots, and demonstrates improved translation quality.
Contribution
It introduces synchronous morphology constraint features, comparing handcrafted and learned constraints, to improve pivot SMT quality using limited parallel data.
Findings
Achieved 1.5 BLEU point improvement over baseline
Achieved 0.8 BLEU point improvement over direct model
Positive results on Hebrew-Arabic translation with English pivot
Abstract
The lack of parallel data for many language pairs is an important challenge to statistical machine translation (SMT). One common solution is to pivot through a third language for which there exist parallel corpora with the source and target languages. Although pivoting is a robust technique, it introduces some low quality translations especially when a poor morphology language is used as the pivot between rich morphology languages. In this paper, we examine the use of synchronous morphology constraint features to improve the quality of phrase pivot SMT. We compare hand-crafted constraints to those learned from limited parallel data between source and target languages. The learned morphology constraints are based on projected align- ments between the source and target phrases in the pivot phrase table. We show positive results on Hebrew-Arabic SMT (pivoting on English). We get 1.5 BLEU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Biomedical Text Mining and Ontologies
