Dragoman: Efficiently Evaluating Declarative Mapping Languages over Frameworks for Knowledge Graph Creation
Samaneh Jozashoori, Enrique Iglesias, Maria-Esther Vidal

TL;DR
Dragoman introduces an optimization framework that transforms function-based RDF mapping rules into function-free equivalents, significantly reducing execution time and improving scalability in knowledge graph creation pipelines.
Contribution
The paper formalizes a transformation and optimization process for function-based mappings, enabling efficient evaluation by converting them into function-free mappings.
Findings
Eager evaluation of function-based mappings reduces execution time by up to 75%.
Transformations maintain correctness and equivalence to original mappings.
Framework Dragoman demonstrates scalability in large, complex data integration scenarios.
Abstract
In recent years, there have been valuable efforts and contributions to make the process of RDF knowledge graph creation traceable and transparent; extending and applying declarative mapping languages is an example. One challenging step is the traceability of procedures that aim to overcome interoperability issues, a.k.a. data-level integration. In most pipelines, data integration is performed by ad-hoc programs, preventing traceability and reusability. However, formal frameworks provided by function-based declarative mapping languages such as FunUL and RML+FnO empower expressiveness. Data-level integration can be defined as functions and integrated as part of the mappings performing schema-level integration. However, combining functions with the mappings introduces a new source of complexity that can considerably impact the required number of resources and execution time. We tackle the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Advanced Database Systems and Queries · Scientific Computing and Data Management
