Exploring Large Language Models for Translating Romanian Computational Problems into English
Adrian Marius Dumitran, Adrian-Catalin Badea, Stefan-Gabriel Muscalu, Angela-Liliana Dumitran, Stefan-Cosmin Dascalescu, Radu-Sebastian Amarie

TL;DR
This paper investigates the effectiveness of large language models in translating Romanian computational problems into English, demonstrating that with proper prompts and supervision, they can reliably support multilingual problem-solving and educational applications.
Contribution
The study evaluates multiple LLMs for translating Romanian computational tasks, introduces an augmented dataset with English translations, and compares LLM performance against human translators.
Findings
LLMs can maintain or improve translation accuracy with well-structured prompts.
Augmented datasets enhance LLM training and evaluation for multilingual tasks.
Human oversight is crucial for ensuring translation quality in critical applications.
Abstract
Recent studies have suggested that large language models (LLMs) underperform on mathematical and computer science tasks when these problems are translated from Romanian into English, compared to their original Romanian format. Accurate translation is critical for applications ranging from automatic translations in programming competitions to the creation of high-quality educational materials, as well as minimizing errors or fraud in human translations. This study shows that robust large language models (LLMs) can maintain or even enhance their performance in translating less common languages when given well-structured prompts. Our findings suggest that LLMs, with appropriate supervision, can be reliably used for the automatic translation of IOI (International Olympiad in Informatics)-style tasks. We evaluate several translation methods across multiple LLMs, including OpenRoLLM, Llama…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsLLaMA
