A Multifaceted Evaluation of Neural versus Phrase-Based Machine   Translation for 9 Language Directions

Antonio Toral; V\'ictor M. S\'anchez-Cartagena

arXiv:1701.02901·cs.CL·January 12, 2017·2 cites

A Multifaceted Evaluation of Neural versus Phrase-Based Machine Translation for 9 Language Directions

Antonio Toral, V\'ictor M. S\'anchez-Cartagena

PDF

Open Access 1 Repo

TL;DR

This paper compares neural and phrase-based machine translation across nine language pairs, revealing neural methods produce more fluent, accurate, and diverse translations but struggle with very long sentences.

Contribution

It provides a comprehensive, multi-dimensional evaluation of neural versus phrase-based translation systems across multiple languages and metrics.

Findings

01

Neural translation outputs are more fluent and accurate in word order.

02

Neural systems better handle inflected forms.

03

Neural translation struggles with very long sentences.

Abstract

We aim to shed light on the strengths and weaknesses of the newly introduced neural machine translation paradigm. To that end, we conduct a multifaceted evaluation in which we compare outputs produced by state-of-the-art neural machine translation and phrase-based machine translation systems for 9 language directions across a number of dimensions. Specifically, we measure the similarity of the outputs, their fluency and amount of reordering, the effect of sentence length and performance across different error categories. We find out that translations produced by neural machine translation systems are considerably different, more fluent and more accurate in terms of word order compared to those produced by phrase-based systems. Neural machine translation systems are also more accurate at producing inflected forms, but they perform poorly when translating very long sentences.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

antot/neural_vs_-phrasebased_smt_eacl17
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications