ROSIE: Runtime Optimization of SPARQL Queries Using Incremental Evaluation
Lei Gai, Wei Chen, Tengjiao Wang

TL;DR
ROSIE is a runtime optimization framework for SPARQL queries that iteratively refines query plans based on actual cardinality, significantly improving performance on complex queries over large-scale RDF data.
Contribution
It introduces a novel runtime re-optimization approach for SPARQL queries using incremental evaluation and heuristic plan generation, addressing cardinality estimation errors.
Findings
Outperforms state-of-the-art methods on complex queries
Achieves orders of magnitude speedup on large datasets
Effectively detects and corrects cardinality estimation errors
Abstract
Relational databases are wildly adopted in RDF (Resource Description Framework) data management. For efficient SPARQL query evaluation, the legacy query optimizer needs reconsiderations. One vital problem is how to tackle the suboptimal query plan caused by error-prone cardinality estimation. Consider the schema-free nature of RDF data and the Join-intensive characteristic of SPARQL query, determine an optimal execution order before the query actually evaluated is costly or even infeasible, especially for complex queries on large-scale data. In this paper, we propose ROSIE, a Runtime Optimization framework that iteratively re-optimize SPARQL query plan according to the actual cardinality derived from Incremental partial query Evaluation. By introducing an approach for heuristic-based plan generation, as well as a mechanism to detect cardinality estimation error at runtime, ROSIE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Advanced Database Systems and Queries · Data Management and Algorithms
