Investigating Length Issues in Document-level Machine Translation

Ziqian Peng; Rachel Bawden; Fran\c{c}ois Yvon

arXiv:2412.17592·cs.CL·April 29, 2025

Investigating Length Issues in Document-level Machine Translation

Ziqian Peng, Rachel Bawden, Fran\c{c}ois Yvon

PDF

TL;DR

This paper examines how increasing document length impacts machine translation quality, revealing that longer texts and sentence positions within documents significantly affect translation performance, with current methods still lagging behind sentence-level translation.

Contribution

The study provides a new measurement approach for length effects in document-level MT and demonstrates the persistent challenges with longer texts and sentence positioning.

Findings

01

Translation quality decreases as input length increases.

02

Sentence position within a document influences translation accuracy.

03

Manipulating document length distribution and positional embeddings has limited effect.

Abstract

Transformer architectures are increasingly effective at processing and generating very long chunks of texts, opening new perspectives for document-level machine translation (MT). In this work, we challenge the ability of MT systems to handle texts comprising up to several thousands of tokens. We design and implement a new approach designed to precisely measure the effect of length increments on MT outputs. Our experiments with two representative architectures unambiguously show that (a)~translation performance decreases with the length of the input text; (b)~the position of sentences within the document matters, and translation quality is higher for sentences occurring earlier in a document. We further show that manipulating the distribution of document lengths and of positional embeddings only marginally mitigates such problems. Our results suggest that even though document-level MT is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.