EffiReasonTrans: RL-Optimized Reasoning for Code Translation

Yanlin Wang; Rongyi Ou; Yanli Wang; Mingwei Liu; Jiachi Chen; Ensheng Shi; Xilin Liu; Yuchi Ma; and Zibin Zheng

arXiv:2510.18863·cs.SE·October 22, 2025

EffiReasonTrans: RL-Optimized Reasoning for Code Translation

Yanlin Wang, Rongyi Ou, Yanli Wang, Mingwei Liu, Jiachi Chen, Ensheng Shi, Xilin Liu, Yuchi Ma, and Zibin Zheng

PDF

Open Access

TL;DR

EffiReasonTrans is a training framework that enhances code translation accuracy using reasoning-augmented data and a two-stage training process, achieving better performance with reduced inference latency.

Contribution

It introduces a novel two-stage training method with reasoning-augmented datasets to improve code translation accuracy while balancing inference latency.

Findings

01

Up to +49.2% CA and +27.8% CodeBLEU improvements.

02

Reduced generated tokens by up to -19.3%.

03

Lowered inference latency in most cases by up to -29.0%.

Abstract

Code translation is a crucial task in software development and maintenance. While recent advancements in large language models (LLMs) have improved automated code translation accuracy, these gains often come at the cost of increased inference latency, hindering real-world development workflows that involve human-in-the-loop inspection. To address this trade-off, we propose EffiReasonTrans, a training framework designed to improve translation accuracy while balancing inference latency. We first construct a high-quality reasoning-augmented dataset by prompting a stronger language model, DeepSeek-R1, to generate intermediate reasoning and target translations. Each (source code, reasoning, target code) triplet undergoes automated syntax and functionality checks to ensure reliability. Based on this dataset, we employ a two-stage training strategy: supervised fine-tuning on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling · Natural Language Processing Techniques