On the Integration of LinguisticFeatures into Statistical and Neural Machine Translation
Eva Vanmassenhove

TL;DR
This paper analyzes the integration of linguistic features into statistical and neural machine translation, highlighting remaining challenges and proposing solutions to improve translation accuracy by addressing linguistic discrepancies.
Contribution
It identifies key linguistic issues in MT, evaluates the impact of integrating linguistic features, and discusses the drawbacks of neural MT such as algorithmic bias.
Findings
Neural MT surpasses statistical MT in many aspects.
Linguistic features can improve translation accuracy.
Algorithmic bias affects neural MT performance.
Abstract
New machine translations (MT) technologies are emerging rapidly and with them, bold claims of achieving human parity such as: (i) the results produced approach "accuracy achieved by average bilingual human translators" (Wu et al., 2017b) or (ii) the "translation quality is at human parity when compared to professional human translators" (Hassan et al., 2018) have seen the light of day (Laubli et al., 2018). Aside from the fact that many of these papers craft their own definition of human parity, these sensational claims are often not supported by a complete analysis of all aspects involved in translation. Establishing the discrepancies between the strengths of statistical approaches to MT and the way humans translate has been the starting point of our research. By looking at MT output and linguistic theory, we were able to identify some remaining issues. The problems range from simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
