An empirical analysis of phrase-based and neural machine translation
Hamidreza Ghader

TL;DR
This paper compares phrase-based and neural machine translation systems to understand what linguistic information they learn, focusing on phrase reordering and attention models, and analyzes how source language information is encoded.
Contribution
It provides an empirical analysis of the linguistic phenomena captured by both MT paradigms, highlighting the interpretability of models like attention and encoder representations.
Findings
Phrase reordering models are influenced by specific words within phrases.
Attention models in neural MT capture source-side syntactic and semantic information.
Encoder hidden states encode significant linguistic features.
Abstract
Two popular types of machine translation (MT) are phrase-based and neural machine translation systems. Both of these types of systems are composed of multiple complex models or layers. Each of these models and layers learns different linguistic aspects of the source language. However, for some of these models and layers, it is not clear which linguistic phenomena are learned or how this information is learned. For phrase-based MT systems, it is often clear what information is learned by each model, and the question is rather how this information is learned, especially for its phrase reordering model. For neural machine translation systems, the situation is even more complex, since for many cases it is not exactly clear what information is learned and how it is learned. To shed light on what linguistic phenomena are captured by MT systems, we analyze the behavior of important models in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
