EffiReasonTrans: RL-Optimized Reasoning for Code Translation
Yanlin Wang, Rongyi Ou, Yanli Wang, Mingwei Liu, Jiachi Chen, Ensheng Shi, Xilin Liu, Yuchi Ma, and Zibin Zheng

TL;DR
EffiReasonTrans is a training framework that enhances code translation accuracy using reasoning-augmented data and a two-stage training process, achieving better performance with reduced inference latency.
Contribution
It introduces a novel two-stage training method with reasoning-augmented datasets to improve code translation accuracy while balancing inference latency.
Findings
Up to +49.2% CA and +27.8% CodeBLEU improvements.
Reduced generated tokens by up to -19.3%.
Lowered inference latency in most cases by up to -29.0%.
Abstract
Code translation is a crucial task in software development and maintenance. While recent advancements in large language models (LLMs) have improved automated code translation accuracy, these gains often come at the cost of increased inference latency, hindering real-world development workflows that involve human-in-the-loop inspection. To address this trade-off, we propose EffiReasonTrans, a training framework designed to improve translation accuracy while balancing inference latency. We first construct a high-quality reasoning-augmented dataset by prompting a stronger language model, DeepSeek-R1, to generate intermediate reasoning and target translations. Each (source code, reasoning, target code) triplet undergoes automated syntax and functionality checks to ensure reliability. Based on this dataset, we employ a two-stage training strategy: supervised fine-tuning on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling · Natural Language Processing Techniques
