An Investigation of the Sampling-Based Alignment Method and Its   Contributions

Juan Luo; Yves Lepage

arXiv:1308.4479·cs.CL·August 22, 2013

An Investigation of the Sampling-Based Alignment Method and Its Contributions

Juan Luo, Yves Lepage

PDF

TL;DR

This paper enhances a sampling-based alignment method for phrase translation tables by enforcing n-gram alignments and adjusting their distribution, leading to improved translation quality in statistical machine translation.

Contribution

It introduces a novel approach to increase n-gram alignments using distribution adjustments and compares merged translation tables for better translation performance.

Findings

01

Increased number of n-gram alignments improves translation quality.

02

Distribution adjustment leads to better evaluation results.

03

Merging tables from different methods enhances translation accuracy.

Abstract

By investigating the distribution of phrase pairs in phrase translation tables, the work in this paper describes an approach to increase the number of n-gram alignments in phrase translation tables output by a sampling-based alignment method. This approach consists in enforcing the alignment of n-grams in distinct translation subtables so as to increase the number of n-grams. Standard normal distribution is used to allot alignment time among translation subtables, which results in adjustment of the distribution of n- grams. This leads to better evaluation results on statistical machine translation tasks than the original sampling-based alignment approach. Furthermore, the translation quality obtained by merging phrase translation tables computed from the sampling-based alignment method and from MGIZA++ is examined.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.