Culture-Aware Machine Translation in Large Language Models: Benchmarking and Investigation

Zekun Yuan; Yangfan Ye; Xiaocheng Feng; Baohang Li; Qichen Hong; Yunfei Lu; Dandan Tu; Bing Qin

arXiv:2604.24361·cs.CL·April 28, 2026

Culture-Aware Machine Translation in Large Language Models: Benchmarking and Investigation

Zekun Yuan, Yangfan Ye, Xiaocheng Feng, Baohang Li, Qichen Hong, Yunfei Lu, Dandan Tu, Bing Qin

PDF

1 Repo

TL;DR

This paper introduces CanMT, a new dataset and evaluation framework for assessing culture-aware translation in large language models, revealing performance disparities and the importance of reference translations.

Contribution

It provides a novel dataset and evaluation framework specifically designed for culture-aware machine translation in LLMs, along with systematic analysis of model behaviors.

Findings

01

Significant performance differences across models in culture-aware translation.

02

Translation strategies systematically influence model behavior.

03

Reference translations improve evaluation reliability.

Abstract

Large language models (LLMs) have achieved strong performance in general machine translation, yet their ability in culture-aware scenarios remains poorly understood. To bridge this gap, we introduce CanMT, a Culture-Aware Novel-Driven Parallel Dataset for Machine Translation, together with a theoretically grounded, multi-dimensional evaluation framework for assessing cultural translation quality. Leveraging CanMT, we systematically evaluate a wide range of LLMs and translation systems under different translation strategy constraints. Our findings reveal substantial performance disparities across models and demonstrate that translation strategies exert a systematic influence on model behavior. Further analysis shows that translation difficulty varies across types of culture-specific items, and that a persistent gap remains between models' recognition of culture-specific knowledge and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

null
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.