Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese

Yikang Liu; Wanyang Zhang; Yiming Wang; Jialong Tang; Pei Zhang; Baosong Yang; Fei Huang; Rui Wang; Hai Hu

arXiv:2507.12260·cs.CL·September 22, 2025

Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese

Yikang Liu, Wanyang Zhang, Yiming Wang, Jialong Tang, Pei Zhang, Baosong Yang, Fei Huang, Rui Wang, Hai Hu

PDF

Open Access 1 Video

TL;DR

This paper introduces the translationese-index (T-index), a novel measure based on likelihood ratios of fine-tuned language models, to grade translationese in texts, demonstrating its effectiveness and generalizability across domains and its independence from existing MT quality metrics.

Contribution

The paper proposes the first graded measure for translationese, the T-index, using contrastively fine-tuned language models, and validates its effectiveness across diverse datasets and in relation to human judgments.

Findings

01

T-index generalizes across genres, authors, and language pairs.

02

T-index correlates well with human judgments of translation quality.

03

T-index provides a complementary metric to existing MT quality estimation methods.

Abstract

Translationese refers to linguistic properties that usually occur in translated texts. Previous works study translationese by framing it as a binary classification between original texts and translated texts. In this paper, we argue that translationese should be graded instead of binary and propose the first measure for translationese -- the translationese-index (T-index), computed from the likelihood ratios of two contrastively fine-tuned language models (LMs). We use synthesized translations and translations in the wild to evaluate T-index's generalizability in cross-domain settings and its validity against human judgments. Our results show that T-index can generalize to unseen genres, authors, and language pairs. Moreover, T-index computed using two 0.5B LMs fine-tuned on only 1-5k pairs of synthetic data can effectively capture translationese, as demonstrated by alignment with human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese· underline

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification · Authorship Attribution and Profiling