Evaluating the effectiveness of LLM-based interoperability
Rodrigo Falc\~ao, Stefan Schweitzer, Julien Siebert, Emily Calvet, Frank Elberzhager

TL;DR
This paper evaluates the ability of large language models to enable autonomous, runtime interoperability between heterogeneous systems, focusing on agricultural data exchange and comparing multiple models and strategies.
Contribution
It introduces an empirical evaluation of 13 open source LLMs for autonomous system interoperability, highlighting the effectiveness of specific models and strategies in a real-world use case.
Findings
qwen2.5-coder:32b achieved highest effectiveness with both strategies
Strategy CODEGEN outperformed DIRECT in complex dataset scenarios
Some LLMs can enable autonomous system interoperability
Abstract
Background: Systems of systems are becoming increasingly dynamic and heterogeneous, and this adds pressure on the long-standing challenge of interoperability. Besides its technical aspect, interoperability has also an economic side, as development time efforts are required to build the interoperability artifacts. Objectives: With the recent advances in the field of large language models (LLMs), we aim at analyzing the effectiveness of LLM-based strategies to make systems interoperate autonomously, at runtime, without human intervention. Method: We selected 13 open source LLMs and curated four versions of a dataset in the agricultural interoperability use case. We performed three runs of each model with each version of the dataset, using two different strategies. Then we compared the effectiveness of the models and the consistency of their results across multiple runs. Results:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
