KadiAssistant: A conversational AI Agent for information retrieval in Kadi4Mat
Adrian Cierpka, Mohammad Shafiqul Islam, Johannes Steinh\"ulb, Eric Dietriche Sesso Domtchoueng, Michael Selzer, Arnd Koeppe

TL;DR
KadiAssistant is a privacy-aware AI tool integrated into the Kadi ecosystem, enabling efficient, secure information retrieval from heterogeneous research data to support interdisciplinary scientific discovery.
Contribution
It introduces a novel combination of a self-hosted large language model with privacy-preserving semantic search for complex, access-controlled research data retrieval.
Findings
Enables structured, comprehensive answers from sensitive research data
Respects fine-grained access permissions in data retrieval
Bridges terminology gaps across disciplines
Abstract
We introduce KadiAssistant, a privacy-by-design AI assistant integrated into the Kadi research data ecosystem, enabling researchers to efficiently access, aggregate, and synthesize information from heterogeneous, privacy-sensitive research data. Interdisciplinary fields such as materials science bring together disciplines with their own terminology and standards. While this convergence fuels innovation, it also makes it increasingly difficult to connect and access knowledge, as data are distributed across disciplines, organizations, and individuals. For example, battery research combines electrochemical measurements, materials characterization data, physics-based simulations, and manufacturing parameters, each using different formats, vocabularies, and standards. Efficiently storing and sharing such heterogeneous data via research data platforms, such as Kadi4Mat, demands domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
