SlangDIT: Benchmarking LLMs in Interpretative Slang Translation
Yunlong Liang, Fandong Meng, Jiaan Wang, Jie Zhou

TL;DR
This paper introduces SlangDIT, a comprehensive benchmark and model for interpretative slang translation that integrates detection, explanation, and translation tasks to improve accuracy in context-dependent slang translation.
Contribution
It presents the first joint benchmark and a deep thinking model, SlangOWL, for slang detection, explanation, and translation, enhancing LLM performance in slang understanding.
Findings
SlangOWL outperforms vanilla and fine-tuned models.
Constructed a 25k English-Chinese slang dataset.
Deep thinking improves LLM slang translation accuracy.
Abstract
The challenge of slang translation lies in capturing context-dependent semantic extensions, as slang terms often convey meanings beyond their literal interpretation. While slang detection, explanation, and translation have been studied as isolated tasks in the era of large language models (LLMs), their intrinsic interdependence remains underexplored. The main reason is lacking of a benchmark where the two tasks can be a prerequisite for the third one, which can facilitate idiomatic translation. In this paper, we introduce the interpretative slang translation task (named SlangDIT) consisting of three sub-tasks: slang detection, cross-lingual slang explanation, and slang translation within the current context, aiming to generate more accurate translation with the help of slang detection and slang explanation. To this end, we construct a SlangDIT dataset, containing over 25k…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Translation Studies and Practices
