Understanding and Enhancing the Use of Context for Machine Translation

Marzieh Fadaee

arXiv:2102.10437·cs.CL·February 23, 2021

Understanding and Enhancing the Use of Context for Machine Translation

Marzieh Fadaee

PDF

Open Access

TL;DR

This paper investigates how neural machine translation models utilize context to better understand language nuances, especially with scarce data, and proposes methods to improve their generalization and robustness.

Contribution

It provides insights into the role of context in neural translation models and introduces augmentation techniques to enhance learning from limited data.

Findings

01

Context improves translation accuracy for rare words

02

Augmentation models enhance generalization to unseen lexical units

03

Understanding data influence reveals vulnerabilities in current models

Abstract

To understand and infer meaning in language, neural models have to learn complicated nuances. Discovering distinctive linguistic phenomena from data is not an easy task. For instance, lexical ambiguity is a fundamental feature of language which is challenging to learn. Even more prominently, inferring the meaning of rare and unseen lexical units is difficult with neural networks. Meaning is often determined from context. With context, languages allow meaning to be conveyed even when the specific words used are not known by the reader. To model this learning process, a system has to learn from a few instances in context and be able to generalize well to unseen cases. The learning process is hindered when training data is scarce for a task. Even with sufficient data, learning patterns for the long tail of the lexical distribution is challenging. In this thesis, we focus on understanding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis