Proverbs Run in Pairs: Evaluating Proverb Translation Capability of   Large Language Model

Minghan Wang; Viet-Thanh Pham; Farhad Moghimifar; Thuy-Trang Vu

arXiv:2501.11953·cs.CL·January 22, 2025

Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model

Minghan Wang, Viet-Thanh Pham, Farhad Moghimifar, Thuy-Trang Vu

PDF

Open Access 1 Video

TL;DR

This paper evaluates the ability of large language models and neural machine translation systems to translate culturally rooted proverbs across languages, revealing strengths in similar cultures and limitations of current evaluation metrics.

Contribution

It introduces a new proverb translation dataset and compares LLMs with NMT models, highlighting the superior performance of LLMs and the inadequacy of existing automatic evaluation metrics.

Findings

01

LLMs outperform NMT in proverb translation.

02

Models perform better between culturally similar languages.

03

Current automatic metrics are unreliable for proverb translation quality.

Abstract

Despite achieving remarkable performance, machine translation (MT) research remains underexplored in terms of translating cultural elements in languages, such as idioms, proverbs, and colloquial expressions. This paper investigates the capability of state-of-the-art neural machine translation (NMT) and large language models (LLMs) in translating proverbs, which are deeply rooted in cultural contexts. We construct a translation dataset of standalone proverbs and proverbs in conversation for four language pairs. Our experiments show that the studied models can achieve good translation between languages with similar cultural backgrounds, and LLMs generally outperform NMT models in proverb translation. Furthermore, we find that current automatic evaluation metrics such as BLEU, CHRF++ and COMET are inadequate for reliably assessing the quality of proverb translation, highlighting the need…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsAttentive Walk-Aggregating Graph Neural Network