Machine Translation Testing via Syntactic Tree Pruning
Quanjun Zhang, Juan Zhai, Chunrong Fang, Jiawei Liu, Weisong Sun,, Haichuan Hu, Qingyu Wang

TL;DR
This paper introduces a novel syntactic tree pruning metamorphic testing method to validate machine translation systems, significantly improving error detection accuracy over existing techniques.
Contribution
It proposes a semantics-preserving pruning strategy and a bag-of-words model for detecting translation errors, enhancing testing effectiveness for neural machine translation systems.
Findings
Detected over 10,000 erroneous translations in two popular systems
Achieved 64.5% and 65.4% precision in error identification
Found more errors than state-of-the-art techniques, with higher recall
Abstract
Machine translation systems have been widely adopted in our daily life, making life easier and more convenient. Unfortunately, erroneous translations may result in severe consequences, such as financial losses. This requires to improve the accuracy and the reliability of machine translation systems. However, it is challenging to test machine translation systems because of the complexity and intractability of the underlying neural models. To tackle these challenges, we propose a novel metamorphic testing approach by syntactic tree pruning (STP) to validate machine translation systems. Our key insight is that a pruned sentence should have similar crucial semantics compared with the original sentence. Specifically, STP (1) proposes a core semantics-preserving pruning strategy by basic sentence structure and dependency relations on the level of syntactic tree representation; (2) generates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Software Engineering Research
MethodsPruning
