A Survey of Word Reordering in Statistical Machine Translation:   Computational Models and Language Phenomena

Arianna Bisazza; Marcello Federico

arXiv:1502.04938·cs.CL·February 27, 2017

A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena

Arianna Bisazza, Marcello Federico

PDF

TL;DR

This survey reviews the challenges and methods of word reordering in statistical machine translation, emphasizing the importance of linguistic phenomena and empirical analysis for improving translation quality across language pairs.

Contribution

It provides a comprehensive overview of reordering models in SMT, analyzes linguistic factors affecting reordering, and offers insights for selecting suitable models based on language-specific phenomena.

Findings

01

Certain linguistic facts can predict reordering behavior.

02

Different reordering models perform variably across language pairs.

03

Understanding language-specific phenomena improves SMT reordering strategies.

Abstract

Word reordering is one of the most difficult aspects of statistical machine translation (SMT), and an important factor of its quality and efficiency. Despite the vast amount of research published to date, the interest of the community in this problem has not decreased, and no single method appears to be strongly dominant across language pairs. Instead, the choice of the optimal approach for a new translation task still seems to be mostly driven by empirical trials. To orientate the reader in this vast and complex research area, we present a comprehensive survey of word reordering viewed as a statistical modeling challenge and as a natural language phenomenon. The survey describes in detail how word reordering is modeled within different string-based and tree-based SMT frameworks and as a stand-alone task, including systematic overviews of the literature in advanced reordering modeling.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.