Leveraging Large Language Models for Code Translation and Software Development in Scientific Computing

Akash Dhruv; Anshu Dubey

arXiv:2410.24119·cs.SE·November 10, 2025

Leveraging Large Language Models for Code Translation and Software Development in Scientific Computing

Akash Dhruv, Anshu Dubey

PDF

Open Access 1 Repo

TL;DR

This paper explores how large language models and GenAI can assist in code translation and development for scientific computing, introducing a tool called CodeScribe that improves legacy code modernization.

Contribution

We developed CodeScribe, a novel tool combining prompt engineering and supervision to facilitate efficient code translation and integration in scientific computing workflows.

Findings

01

CodeScribe effectively translates Fortran to C++

02

It generates APIs for legacy-modern system integration

03

AI-assisted translation enhances productivity in scientific computing

Abstract

The emergence of foundational models and generative artificial intelligence (GenAI) is poised to transform productivity in scientific computing, especially in code development, refactoring, and translating from one programming language to another. However, because the output of GenAI cannot be guaranteed to be correct, manual intervention remains necessary. Some of this intervention can be automated through task-specific tools, alongside additional methodologies for correctness verification and effective prompt development. We explored the application of GenAI in assisting with code translation, language interoperability, and codebase inspection within a legacy Fortran codebase used to simulate particle interactions at the Large Hadron Collider (LHC). In the process, we developed a tool, CodeScribe, which combines prompt engineering with user supervision to establish an efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neucol/llm-conversion-performance
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Computational Physics and Python Applications