Leveraging LLMs for Automated Translation of Legacy Code: A Case Study on PL/SQL to Java Transformation

Lola Solovyeva; Eduardo Carneiro Oliveira; Shiyu Fan; Alper Tuncay; Shamil Gareev; Andrea Capiluppi

arXiv:2508.19663·cs.SE·August 28, 2025

Leveraging LLMs for Automated Translation of Legacy Code: A Case Study on PL/SQL to Java Transformation

Lola Solovyeva, Eduardo Carneiro Oliveira, Shiyu Fan, Alper Tuncay, Shamil Gareev, Andrea Capiluppi

PDF

TL;DR

This paper explores using large language models to automate the translation of a large legacy PL/SQL system into Java, demonstrating promising results despite limited data and validation resources.

Contribution

It introduces a novel prompting strategy combining chain-of-guidance and $n$-shot prompting for LLM-based code translation from PL/SQL to Java.

Findings

01

LLMs can generate syntactically correct PL/SQL to Java translations.

02

The customized prompting improves translation quality and functional correctness.

03

Limitations include small dataset and limited validation scope.

Abstract

The VT legacy system, comprising approximately 2.5 million lines of PL/SQL code, lacks consistent documentation and automated tests, posing significant challenges for refactoring and modernisation. This study investigates the feasibility of leveraging large language models (LLMs) to assist in translating PL/SQL code into Java for the modernised "VTF3" system. By leveraging a dataset comprising 10 PL/SQL-to-Java code pairs and 15 Java classes, which collectively established a domain model for the translated files, multiple LLMs were evaluated. Furthermore, we propose a customized prompting strategy that integrates chain-of-guidance reasoning with $n$ -shot prompting. Our findings indicate that this methodology effectively guides LLMs in generating syntactically accurate translations while also achieving functional correctness. However, the findings are limited by the small sample size of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.