Evaluating Large Language Models on Urdu Idiom Translation

Muhammad Farmal Khan; Mousumi Akter

arXiv:2510.17460·cs.CL·October 21, 2025

Evaluating Large Language Models on Urdu Idiom Translation

Muhammad Farmal Khan, Mousumi Akter

PDF

Open Access

TL;DR

This paper introduces the first evaluation datasets for Urdu idiomatic translation, assesses various LLMs and NMT systems, and finds that prompt engineering and script choice significantly influence translation quality.

Contribution

It provides new datasets for Urdu idiomatic translation and evaluates the impact of prompt engineering and script on translation performance.

Findings

01

Prompt engineering improves translation quality.

02

Native Urdu script yields better translations than Roman Urdu.

03

Translation performance varies with text representation.

Abstract

Idiomatic translation remains a significant challenge in machine translation, especially for low resource languages such as Urdu, and has received limited prior attention. To advance research in this area, we introduce the first evaluation datasets for Urdu to English idiomatic translation, covering both Native Urdu and Roman Urdu scripts and annotated with gold-standard English equivalents. We evaluate multiple open-source Large Language Models (LLMs) and Neural Machine Translation (NMT) systems on this task, focusing on their ability to preserve idiomatic and cultural meaning. Automatic metrics including BLEU, BERTScore, COMET, and XCOMET are used to assess translation quality. Our findings indicate that prompt engineering enhances idiomatic translation compared to direct translation, though performance differences among prompt types are relatively minor. Moreover, cross script…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Translation Studies and Practices