Evaluating the effectiveness of LLM-based interoperability

Rodrigo Falc\~ao; Stefan Schweitzer; Julien Siebert; Emily Calvet; Frank Elberzhager

arXiv:2510.23893·cs.SE·October 29, 2025

Evaluating the effectiveness of LLM-based interoperability

Rodrigo Falc\~ao, Stefan Schweitzer, Julien Siebert, Emily Calvet, Frank Elberzhager

PDF

TL;DR

This paper evaluates the ability of large language models to enable autonomous, runtime interoperability between heterogeneous systems, focusing on agricultural data exchange and comparing multiple models and strategies.

Contribution

It introduces an empirical evaluation of 13 open source LLMs for autonomous system interoperability, highlighting the effectiveness of specific models and strategies in a real-world use case.

Findings

01

qwen2.5-coder:32b achieved highest effectiveness with both strategies

02

Strategy CODEGEN outperformed DIRECT in complex dataset scenarios

03

Some LLMs can enable autonomous system interoperability

Abstract

Background: Systems of systems are becoming increasingly dynamic and heterogeneous, and this adds pressure on the long-standing challenge of interoperability. Besides its technical aspect, interoperability has also an economic side, as development time efforts are required to build the interoperability artifacts. Objectives: With the recent advances in the field of large language models (LLMs), we aim at analyzing the effectiveness of LLM-based strategies to make systems interoperate autonomously, at runtime, without human intervention. Method: We selected 13 open source LLMs and curated four versions of a dataset in the agricultural interoperability use case. We performed three runs of each model with each version of the dataset, using two different strategies. Then we compared the effectiveness of the models and the consistency of their results across multiple runs. Results:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.