Overcoming the Curse of Sentence Length for Neural Machine Translation   using Automatic Segmentation

Jean Pouget-Abadie; Dzmitry Bahdanau; Bart van Merrienboer and; Kyunghyun Cho; Yoshua Bengio

arXiv:1409.1257·cs.CL·October 8, 2014

Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation

Jean Pouget-Abadie, Dzmitry Bahdanau, Bart van Merrienboer and, Kyunghyun Cho, Yoshua Bengio

PDF

TL;DR

This paper introduces an automatic segmentation method to break long sentences into manageable parts, significantly improving neural machine translation quality for lengthy inputs.

Contribution

It proposes an automatic segmentation approach to mitigate the impact of sentence length on neural translation quality, enhancing performance on long sentences.

Findings

01

Significant improvement in translation quality for long sentences

02

Effective segmentation reduces neural translation errors on lengthy inputs

03

Method outperforms baseline models on long sentence translation tasks

Abstract

The authors of (Cho et al., 2014a) have shown that the recently introduced neural network translation systems suffer from a significant drop in translation quality when translating long sentences, unlike existing phrase-based translation systems. In this paper, we propose a way to address this issue by automatically segmenting an input sentence into phrases that can be easily translated by the neural network translation model. Once each segment has been independently translated by the neural machine translation model, the translated clauses are concatenated to form a final translation. Empirical results show a significant improvement in translation quality for long sentences.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.