Software Model Evolution with Large Language Models: Experiments on Simulated, Public, and Industrial Datasets
Christof Tinnes, Alisa Welter, Sven Apel

TL;DR
This paper investigates the use of large language models for software model evolution, proposing RAMC and demonstrating promising results across industrial, public, and simulated datasets.
Contribution
It introduces RAMC, a novel approach leveraging large language models and retrieval-augmented generation for software model completion, validated on diverse datasets.
Findings
62.30% semantically correct completions on industrial data
up to 86.19% type-correct completions
large language models excel with scarce or noisy data
Abstract
Modeling structure and behavior of software systems plays a crucial role in the industrial practice of software engineering. As with other software engineering artifacts, software models are subject to evolution. Supporting modelers in evolving software models with recommendations for model completions is still an open problem, though. In this paper, we explore the potential of large language models for this task. In particular, we propose an approach, RAMC, leveraging large language models, model histories, and retrieval-augmented generation for model completion. Through experiments on three datasets, including an industrial application, one public open-source community dataset, and one controlled collection of simulated model repositories, we evaluate the potential of large language models for model completion with RAMC. We found that large language models are indeed a promising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Business Process Modeling and Analysis
