Coverage Embedding Models for Neural Machine Translation

Haitao Mi; Baskaran Sankaran; Zhiguo Wang; Abe Ittycheriah

arXiv:1605.03148·cs.CL·August 30, 2016·35 cites

Coverage Embedding Models for Neural Machine Translation

Haitao Mi, Baskaran Sankaran, Zhiguo Wang, Abe Ittycheriah

PDF

Open Access

TL;DR

This paper introduces coverage embedding models for neural machine translation to improve translation accuracy by explicitly tracking source word coverage, reducing errors like repetition and omission.

Contribution

It proposes a novel coverage embedding approach that dynamically updates coverage information during translation, enhancing NMT performance.

Findings

01

Significant improvement in translation quality on Chinese-English tasks

02

Effective reduction of repeated and dropped translations

03

Enhanced model outperforms baseline NMT systems

Abstract

In this paper, we enhance the attention-based neural machine translation (NMT) by adding explicit coverage embedding models to alleviate issues of repeating and dropping translations in NMT. For each source word, our model starts with a full coverage embedding vector to track the coverage status, and then keeps updating it with neural networks as the translation goes. Experiments on the large-scale Chinese-to-English task show that our enhanced model improves the translation quality significantly on various test sets over the strong large vocabulary NMT system.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications