LegacyTranslate: LLM-based Multi-Agent Method for Legacy Code Translation
Zahra Moti, Heydar Soudani, Jonck van der Kogel

TL;DR
LegacyTranslate is a multi-agent framework utilizing specialized LLMs to improve the accuracy and integration of legacy code translation, specifically from PL/SQL to Java, in enterprise modernization projects.
Contribution
It introduces a novel multi-agent approach with API-aware agents that enhance code translation quality and correctness in real-world legacy system modernization.
Findings
Initial translation achieves 45.6% compilable code
API grounding improves compilation by 8%
Refinement increases test-pass rate by 3%
Abstract
Modernizing large legacy systems remains a major challenge in enterprise environments, particularly when migration must preserve domain-specific logic while conforming to internal architectural frameworks and shared APIs. Direct application of Large Language Models (LLMs) for code translation often produces syntactically valid outputs that fail to compile or integrate within existing production frameworks, limiting their practical adoption in real-world modernization efforts. In this paper, we propose LegacyTranslate, a multi-agent framework for API-aware code translation, developed and evaluated in the context of an ongoing modernization effort at a financial institution migrating approximately 2.5 million lines of PL/SQL to Java. The core idea is to use specialized LLM-based agents, each addressing a different aspect of the translation challenge. Specifically, LegacyTranslate consists…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Natural Language Processing Techniques · Model-Driven Software Engineering Techniques
