Handling Homographs in Neural Machine Translation

Frederick Liu; Han Lu; Graham Neubig

arXiv:1708.06510·cs.CL·March 29, 2018·1 cites

Handling Homographs in Neural Machine Translation

Frederick Liu, Han Lu, Graham Neubig

PDF

Open Access

TL;DR

This paper investigates the persistent challenge of translating homographs in neural machine translation systems, showing that current models still struggle with ambiguity and proposing context-aware embeddings to improve translation accuracy.

Contribution

The paper introduces context-aware word embeddings inspired by word sense disambiguation to enhance NMT's handling of homographs, demonstrating improved translation performance.

Findings

01

Existing NMT systems still struggle with homographs.

02

Context-aware embeddings improve BLEU scores.

03

Enhanced models better translate ambiguous words.

Abstract

Homographs, words with different meanings but the same surface form, have long caused difficulty for machine translation systems, as it is difficult to select the correct translation based on the context. However, with the advent of neural machine translation (NMT) systems, which can theoretically take into account global sentential context, one may hypothesize that this problem has been alleviated. In this paper, we first provide empirical evidence that existing NMT systems in fact still have significant problems in properly translating ambiguous words. We then proceed to describe methods, inspired by the word sense disambiguation literature, that model the context of the input word with context-aware word embeddings that help to differentiate the word sense be- fore feeding it into the encoder. Experiments on three language pairs demonstrate that such models improve the performance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications