Heuristics-based Query Reordering for Federated Queries in SPARQL 1.1 and SPARQL-LD
Thanos Yannakis, Pavlos Fafalios, Yannis Tzitzikas

TL;DR
This paper introduces heuristic-based query reordering techniques for federated SPARQL 1.1 and SPARQL-LD queries, significantly reducing execution times by optimizing remote resource calls without needing source statistics.
Contribution
It proposes a novel set of heuristics for query reordering that improve federated query performance in SPARQL 1.1 and SPARQL-LD without relying on remote source statistics.
Findings
Query reordering reduces execution time significantly.
The method achieves optimal plans in 88% of cases.
Applicable to existing SPARQL implementations.
Abstract
The federated query extension of SPARQL 1.1 allows executing queries distributed over different SPARQL endpoints. SPARQL-LD is a recent extension of SPARQL 1.1 which enables to directly query any HTTP web source containing RDF data, like web pages embedded with RDFa, JSON-LD or Microformats, without requiring the declaration of named graphs. This makes possible to query a large number of data sources (including SPARQL endpoints, online resources, or even Web APIs returning RDF data) through a single one concise query. However, not optimal formulation of SPARQL 1.1 and SPARQL-LD queries can lead to a large number of calls to remote resources which in turn can lead to extremely high query execution times. In this paper, we address this problem and propose a set of query reordering methods which make use of heuristics to reorder a set of SERVICE graph patterns based on their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Advanced Database Systems and Queries · Data Management and Algorithms
