Towards Enabling FAIR Dataspaces Using Large Language Models

Benedikt T. Arnold; Johannes Theissen-Lipp; Diego Collarana; Christoph; Lange; Sandra Geisler; Edward Curry; Stefan Decker

arXiv:2403.15451·cs.CL·March 26, 2024·1 cites

Towards Enabling FAIR Dataspaces Using Large Language Models

Benedikt T. Arnold, Johannes Theissen-Lipp, Diego Collarana, Christoph, Lange, Sandra Geisler, Edward Curry, Stefan Decker

PDF

Open Access

TL;DR

This paper explores how Large Language Models can facilitate the adoption of FAIR dataspaces, addressing complexity challenges and proposing a research agenda for future exploration.

Contribution

It demonstrates the potential of LLMs in supporting FAIR dataspaces and outlines a research agenda for this emerging field.

Findings

01

LLMs can support FAIR dataspaces effectively

02

A concrete example illustrating LLM application in dataspaces

03

Proposes a research agenda for future work

Abstract

Dataspaces have recently gained adoption across various sectors, including traditionally less digitized domains such as culture. Leveraging Semantic Web technologies helps to make dataspaces FAIR, but their complexity poses a significant challenge to the adoption of dataspaces and increases their cost. The advent of Large Language Models (LLMs) raises the question of how these models can support the adoption of FAIR dataspaces. In this work, we demonstrate the potential of LLMs in dataspaces with a concrete example. We also derive a research agenda for exploring this emerging field.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsResearch Data Management Practices · Scientific Computing and Data Management · Data Quality and Management