TL;DR
This paper evaluates how neural machine translation models handle negation translation between English, German, and Chinese, revealing improvements with deeper networks but highlighting persistent under-translation issues and complex model behaviors.
Contribution
It provides a comprehensive evaluation of negation translation in NMT, analyzes the causes of under-translation, and explores how negation information is represented within models.
Findings
NMT performance on negation improves with deeper networks.
Under-translation is the main error type in NMT for negation.
Models encode negation information well but still struggle with reliable translation.
Abstract
In this paper, we evaluate the translation of negation both automatically and manually, in English--German (EN--DE) and English--Chinese (EN--ZH). We show that the ability of neural machine translation (NMT) models to translate negation has improved with deeper and more advanced networks, although the performance varies between language pairs and translation directions. The accuracy of manual evaluation in EN-DE, DE-EN, EN-ZH, and ZH-EN is 95.7%, 94.8%, 93.4%, and 91.7%, respectively. In addition, we show that under-translation is the most significant error type in NMT, which contrasts with the more diverse error profile previously observed for statistical machine translation. To better understand the root of the under-translation of negation, we study the model's information flow and training data. While our information flow analysis does not reveal any deficiencies that could be used…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
