TL;DR
AlphaTrans is a neuro-symbolic system that automates repository-level code translation and validation, effectively handling complex real-world projects and ensuring functional correctness through multiple validation steps.
Contribution
It introduces a novel neuro-symbolic approach that decomposes programs for translation, enabling scalable and accurate repository-level code translation and validation.
Findings
96.40% syntactic correctness in translated fragments
25.14% of fragments validated for functional correctness
20.1 hours average time for developers to fix translation bugs
Abstract
Code translation transforms programs from one programming language (PL) to another. Several rule-based transpilers have been designed to automate code translation between different pairs of PLs. However, the rules can become obsolete as the PLs evolve and cannot generalize to other PLs. Recent studies have explored the automation of code translation using Large Language Models (LLMs). One key observation is that such techniques may work well for crafted benchmarks but fail to generalize to the scale and complexity of real-world projects with dependencies, custom types, PL-specific features, etc. We propose AlphaTrans, a neuro-symbolic approach to automate repository-level code translation. AlphaTrans translates both source and test code, and employs multiple levels of validation to ensure the translation preserves the functionality of the source program. To break down the problem for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
