Detecting and Understanding Generalization Barriers for Neural Machine   Translation

Guanlin Li; Lemao Liu; Conghui Zhu; Tiejun Zhao; Shuming Shi

arXiv:2004.02181·cs.CL·April 7, 2020·1 cites

Detecting and Understanding Generalization Barriers for Neural Machine Translation

Guanlin Li, Lemao Liu, Conghui Zhu, Tiejun Zhao, Shuming Shi

PDF

Open Access

TL;DR

This paper investigates the specific words in input sentences that hinder neural machine translation generalization, proposing methods to detect these barriers and analyze their impact on translation quality.

Contribution

It introduces a formal definition of generalization barrier words, a tractable detection method, and comprehensive analysis of their effects in neural machine translation.

Findings

01

Identified words that cause translation degradation

02

Proposed effective barrier detection methods

03

Analyzed barrier impact on translation performance

Abstract

Generalization to unseen instances is our eternal pursuit for all data-driven models. However, for realistic task like machine translation, the traditional approach measuring generalization in an average sense provides poor understanding for the fine-grained generalization ability. As a remedy, this paper attempts to identify and understand generalization barrier words within an unseen input sentence that \textit{cause} the degradation of fine-grained generalization. We propose a principled definition of generalization barrier words and a modified version which is tractable in computation. Based on the modified one, we propose three simple methods for barrier detection by the search-aware risk estimation through counterfactual generation. We then conduct extensive analyses on those detected generalization barrier words on both Zh $\Leftrightarrow$ En NIST benchmarks from various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research