Lexically Cohesive Neural Machine Translation with Copy Mechanism

Vipul Mishra; Chenhui Chu; Yuki Arase

arXiv:2010.05193·cs.CL·October 13, 2020

Lexically Cohesive Neural Machine Translation with Copy Mechanism

Vipul Mishra, Chenhui Chu, Yuki Arase

PDF

Open Access

TL;DR

This paper introduces a copy mechanism in neural machine translation to explicitly enhance lexical cohesion across document translations, leading to more consistent word choices.

Contribution

It presents a novel explicit approach for lexical cohesion in neural translation models by integrating a copy mechanism, improving over previous implicit methods.

Findings

01

Significant improvement in lexical cohesion for Japanese-English translation

02

Model outperforms previous context-aware neural translation models

03

Demonstrates effectiveness of explicit copying for discourse consistency

Abstract

Lexically cohesive translations preserve consistency in word choices in document-level translation. We employ a copy mechanism into a context-aware neural machine translation model to allow copying words from previous translation outputs. Different from previous context-aware neural machine translation models that handle all the discourse phenomena implicitly, our model explicitly addresses the lexical cohesion problem by boosting the probabilities to output words consistently. We conduct experiments on Japanese to English translation using an evaluation dataset for discourse translation. The results showed that the proposed model significantly improved lexical cohesion compared to previous context-aware models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications