BLINKG: A Benchmark for LLM-Integrated Knowledge Graph Generation
Carla Castedo, Enrique Iglesias, Manuel Lama, Alberto Bugarin-Diz, Maria-Esther Vidal, David Chaves-Fraga

TL;DR
This paper introduces BLINKG, a benchmark to evaluate how effectively large language models can assist in constructing knowledge graphs from diverse data sources, highlighting current capabilities and limitations.
Contribution
The paper presents BLINKG, a standardized benchmark for assessing LLMs in knowledge graph generation, and provides an extensive evaluation of state-of-the-art models.
Findings
LLMs show promising results in KG construction tasks.
Performance decreases in complex scenarios.
BLINKG helps identify current limitations of LLMs in this domain.
Abstract
Generating Knowledge Graphs (KGs) remains one of the most time-consuming and labor-intensive tasks for knowledge engineers, as they need to identify semantic equivalences between input data sources and ontology terms. While declarative solutions (e.g., RML, SPARQL-Anything) have helped to generalize this process, aligning input schema elements with ontology terms still involves intricate transformations and requires considerable manual effort. With the advent of Large Language Models (LLMs), there is growing interest in leveraging their capabilities to assist KG engineers. Although some studies have explored using LLMs to automate KG construction, there is still no standardized framework for assessing how effectively they establish correspondences between data schemes and ontology concepts. Therefore, in this paper, we propose BLINKG, a benchmark designed to evaluate the mapping…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
