SlangDIT: Benchmarking LLMs in Interpretative Slang Translation

Yunlong Liang; Fandong Meng; Jiaan Wang; Jie Zhou

arXiv:2505.14181·cs.CL·May 21, 2025

SlangDIT: Benchmarking LLMs in Interpretative Slang Translation

Yunlong Liang, Fandong Meng, Jiaan Wang, Jie Zhou

PDF

Open Access

TL;DR

This paper introduces SlangDIT, a comprehensive benchmark and model for interpretative slang translation that integrates detection, explanation, and translation tasks to improve accuracy in context-dependent slang translation.

Contribution

It presents the first joint benchmark and a deep thinking model, SlangOWL, for slang detection, explanation, and translation, enhancing LLM performance in slang understanding.

Findings

01

SlangOWL outperforms vanilla and fine-tuned models.

02

Constructed a 25k English-Chinese slang dataset.

03

Deep thinking improves LLM slang translation accuracy.

Abstract

The challenge of slang translation lies in capturing context-dependent semantic extensions, as slang terms often convey meanings beyond their literal interpretation. While slang detection, explanation, and translation have been studied as isolated tasks in the era of large language models (LLMs), their intrinsic interdependence remains underexplored. The main reason is lacking of a benchmark where the two tasks can be a prerequisite for the third one, which can facilitate idiomatic translation. In this paper, we introduce the interpretative slang translation task (named SlangDIT) consisting of three sub-tasks: slang detection, cross-lingual slang explanation, and slang translation within the current context, aiming to generate more accurate translation with the help of slang detection and slang explanation. To this end, we construct a SlangDIT dataset, containing over 25k…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Translation Studies and Practices