AlphaTrans: A Neuro-Symbolic Compositional Approach for Repository-Level Code Translation and Validation

Ali Reza Ibrahimzada; Kaiyao Ke; Mrigank Pawagi; Muhammad Salman Abid; Rangeet Pan; Saurabh Sinha; Reyhaneh Jabbarvand

arXiv:2410.24117·cs.SE·June 23, 2025

AlphaTrans: A Neuro-Symbolic Compositional Approach for Repository-Level Code Translation and Validation

Ali Reza Ibrahimzada, Kaiyao Ke, Mrigank Pawagi, Muhammad Salman Abid, Rangeet Pan, Saurabh Sinha, Reyhaneh Jabbarvand

PDF

1 Repo

TL;DR

AlphaTrans is a neuro-symbolic system that automates repository-level code translation and validation, effectively handling complex real-world projects and ensuring functional correctness through multiple validation steps.

Contribution

It introduces a novel neuro-symbolic approach that decomposes programs for translation, enabling scalable and accurate repository-level code translation and validation.

Findings

01

96.40% syntactic correctness in translated fragments

02

25.14% of fragments validated for functional correctness

03

20.1 hours average time for developers to fix translation bugs

Abstract

Code translation transforms programs from one programming language (PL) to another. Several rule-based transpilers have been designed to automate code translation between different pairs of PLs. However, the rules can become obsolete as the PLs evolve and cannot generalize to other PLs. Recent studies have explored the automation of code translation using Large Language Models (LLMs). One key observation is that such techniques may work well for crafted benchmarks but fail to generalize to the scale and complexity of real-world projects with dependencies, custom types, PL-specific features, etc. We propose AlphaTrans, a neuro-symbolic approach to automate repository-level code translation. AlphaTrans translates both source and test code, and employs multiple levels of validation to ensure the translation preserves the functionality of the source program. To break down the problem for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Intelligent-CAT-Lab/AlphaTrans
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.