RepoTransBench: A Real-World Multilingual Benchmark for Repository-Level Code Translation

Yanli Wang; Yanlin Wang; Suiquan Wang; Daya Guo; Jiachi Chen; John Grundy; Xilin Liu; Yuchi Ma; Mingzhi Mao; Hongyu Zhang; Zibin Zheng

arXiv:2412.17744·cs.SE·December 17, 2025

RepoTransBench: A Real-World Multilingual Benchmark for Repository-Level Code Translation

Yanli Wang, Yanlin Wang, Suiquan Wang, Daya Guo, Jiachi Chen, John Grundy, Xilin Liu, Yuchi Ma, Mingzhi Mao, Hongyu Zhang, Zibin Zheng

PDF

Open Access

TL;DR

RepoTransBench is a comprehensive, real-world benchmark for multilingual repository-level code translation, highlighting current challenges and providing a framework to evaluate and improve large language models' performance on complex, large-scale code translation tasks.

Contribution

The paper introduces RepoTransBench, a novel real-world benchmark with 1,897 repository samples across 13 language pairs, and RepoTransAgent, a general framework for repository-level code translation.

Findings

01

Best method achieves only 32.8% success rate.

02

Translation difficulty varies by language pair direction.

03

Dynamic-to-static translation is more challenging than static-to-dynamic.

Abstract

Repository-level code translation refers to translating an entire code repository from one programming language to another while preserving the functionality of the source repository. Many benchmarks have been proposed to evaluate the performance of such code translators. However, previous benchmarks mostly provide fine-grained samples, focusing at either code snippet, function, or file-level code translation. Such benchmarks do not accurately reflect real-world demands, where entire repositories often need to be translated, involving longer code length and more complex functionalities. To address this gap, we propose a new benchmark, named RepoTransBench, which is a real-world multilingual repository-level code translation benchmark featuring 1,897 real-world repository samples across 13 language pairs with automatically executable test suites. Besides, we introduce RepoTransAgent, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Software Testing and Debugging Techniques · Software Engineering Research