Experiments with Large Language Models on Retrieval-Augmented Generation for Closed-Source Simulation Software
Andreas Baumann, Peter Eberhard

TL;DR
This paper evaluates retrieval-augmented generation systems with local large language models for closed-source simulation software, highlighting improvements and challenges in information retrieval and response quality.
Contribution
It demonstrates the effectiveness of RAG systems with local LLMs for closed-source simulation frameworks and explores methods to enhance response accuracy.
Findings
Local LLMs can produce impressive results with RAG systems.
Tailoring information to prompts significantly improves response quality.
Challenges remain due to insufficient information retrieval from closed-source data.
Abstract
Large Language Models (LLMs) are tools that have become indispensable in development and programming. However, they suffer from hallucinations, especially when dealing with unknown knowledge. This is particularly the case when LLMs are to be used to support closed-source software applications. Retrieval-Augmented Generation (RAG) offers an approach to use additional knowledge alongside the pre-trained knowledge of the LLM to respond to user prompts. Possible tasks range from a smart-autocomplete, text extraction for question answering, model summarization, component explaining, compositional reasoning, to creation of simulation components and complete input models. This work tests existing RAG systems for closed-source simulation frameworks, in our case the mesh-free simulation software Pasimodo. Since data protection and intellectual property rights are particularly important for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
