Evaluating OpenAI GPT Models for Translation of Endangered Uralic Languages: A Comparison of Reasoning and Non-Reasoning Architectures
Yehor Tereshchenko, Mika H\"am\"al\"ainen, Svitlana Myroniuk

TL;DR
This paper compares reasoning and non-reasoning GPT models in translating endangered Uralic languages, revealing that reasoning models have lower refusal rates and potentially better performance for language preservation efforts.
Contribution
It provides a novel comparison of GPT architectures for low-resource language translation, highlighting reasoning models' advantages in refusal rates and translation quality.
Findings
Reasoning models show 16% lower refusal rates.
Significant performance differences between model types.
Insights for endangered language translation and preservation.
Abstract
The evaluation of Large Language Models (LLMs) for translation tasks has primarily focused on high-resource languages, leaving a significant gap in understanding their performance on low-resource and endangered languages. This study presents a comprehensive comparison of OpenAI's GPT models, specifically examining the differences between reasoning and non-reasoning architectures for translating between Finnish and four low-resource Uralic languages: Komi-Zyrian, Moksha, Erzya, and Udmurt. Using a parallel corpus of literary texts, we evaluate model willingness to attempt translation through refusal rate analysis across different model architectures. Our findings reveal significant performance variations between reasoning and non-reasoning models, with reasoning models showing 16 percentage points lower refusal rates. The results provide valuable insights for researchers and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Language and cultural evolution · Topic Modeling
