LASSI: An LLM-based Automated Self-Correcting Pipeline for Translating   Parallel Scientific Codes

Matthew T. Dearing; Yiheng Tao; Xingfu Wu; Zhiling Lan; Valerie Taylor

arXiv:2407.01638·cs.SE·May 6, 2025·1 cites

LASSI: An LLM-based Automated Self-Correcting Pipeline for Translating Parallel Scientific Codes

Matthew T. Dearing, Yiheng Tao, Xingfu Wu, Zhiling Lan, Valerie Taylor

PDF

Open Access

TL;DR

LASSI is an automated pipeline that leverages LLMs with self-correcting loops to translate parallel scientific codes between programming languages, significantly improving translation accuracy and runtime performance.

Contribution

The paper introduces LASSI, a novel self-correcting framework for translating parallel scientific codes using LLMs, enabling scalable and accurate bi-directional code translation.

Findings

01

80% of OpenMP to CUDA translations produce expected output

02

85% of CUDA to OpenMP translations produce expected output

03

78% of translations run within 10% of original runtime or faster

Abstract

This paper addresses the problem of providing a novel approach to sourcing significant training data for LLMs focused on science and engineering. In particular, a crucial challenge is sourcing parallel scientific codes in the ranges of millions to billions of codes. To tackle this problem, we propose an automated pipeline framework called LASSI, designed to translate between parallel programming languages by bootstrapping existing closed- or open-source LLMs. LASSI incorporates autonomous enhancement through self-correcting loops where errors encountered during the compilation and execution of generated code are fed back to the LLM through guided prompting for debugging and refactoring. We highlight the bi-directional translation of existing GPU benchmarks between OpenMP target offload and CUDA to validate LASSI. The results of evaluating LASSI with different application codes across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed and Parallel Computing Systems