Understanding and Enhancing the Use of Context for Machine Translation
Marzieh Fadaee

TL;DR
This paper investigates how neural machine translation models utilize context to better understand language nuances, especially with scarce data, and proposes methods to improve their generalization and robustness.
Contribution
It provides insights into the role of context in neural translation models and introduces augmentation techniques to enhance learning from limited data.
Findings
Context improves translation accuracy for rare words
Augmentation models enhance generalization to unseen lexical units
Understanding data influence reveals vulnerabilities in current models
Abstract
To understand and infer meaning in language, neural models have to learn complicated nuances. Discovering distinctive linguistic phenomena from data is not an easy task. For instance, lexical ambiguity is a fundamental feature of language which is challenging to learn. Even more prominently, inferring the meaning of rare and unseen lexical units is difficult with neural networks. Meaning is often determined from context. With context, languages allow meaning to be conveyed even when the specific words used are not known by the reader. To model this learning process, a system has to learn from a few instances in context and be able to generalize well to unseen cases. The learning process is hindered when training data is scarce for a task. Even with sufficient data, learning patterns for the long tail of the lexical distribution is challenging. In this thesis, we focus on understanding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
