Query-Specific Pruning of RML Mappings (Extended Version)
Sitt Min Oo, Olaf Hartig

TL;DR
This paper introduces a query-specific pruning method for RML mappings that enhances efficiency in dynamic RDF graph materialization and querying by reducing unnecessary data processing.
Contribution
It formally defines query satisfiability for SPARQL with RML data and uses this to prune mappings, improving performance in dynamic query environments.
Findings
Pruning reduces materialization time significantly.
Graph size decreases with pruning.
Query response times improve notably.
Abstract
Current approaches for knowledge graph construction with RML focus on full RDF graph materialization without considering user queries. As a result, mapping engines are inefficient in dynamic query environments, materializing large graphs even when only a small subset is needed to answer user queries. In this paper, we formally define satisfiability for SPARQL queries with respect to RDF data obtained via RML mappings and use this property to prune RML mappings for partial RDF graph materialization. Evaluation on the GTFS-Madrid benchmark shows that pruning significantly reduces materialization time, and RDF graph size while also noticeably improving querying time. Thus, enabling existing materialization engines to efficiently support generating RDF graphs in dynamic federated querying environment where user queries change frequently.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
