Opportunities for Shape-based Optimization of Link Traversal Queries
Bryan-Elliott Tam, Ruben Taelman, Pieter Colpaert, Ruben Verborgh

TL;DR
This paper explores using RDF data shapes to optimize link traversal queries in decentralized web data, demonstrating significant reductions in execution time and link traversal through a new source selection algorithm.
Contribution
The authors developed an initial source selection algorithm leveraging RDF data shape mappings to improve the performance of link traversal query processing.
Findings
Reduced execution time by up to 80%
Decreased links traversed by 97%
Potential for non-heuristic query planning using RDF shapes
Abstract
Data on the web is naturally unindexed and decentralized. Centralizing web data, especially personal data, raises ethical and legal concerns. Yet, compared to centralized query approaches, decentralization-friendly alternatives such as Link Traversal Query Processing (LTQP) are significantly less performant and understood. The two main difficulties of LTQP are the lack of apriori information about data sources and the high number of HTTP requests. Exploring decentralized-friendly ways to document unindexed networks of data sources could lead to solutions to alleviate those difficulties. RDF data shapes are widely used to validate linked data documents, therefore, it is worthwhile to investigate their potential for LTQP optimization. In our work, we built an early version of a source selection algorithm for LTQP using RDF data shape mappings with linked data documents and measured its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsManufacturing Process and Optimization · Advanced Numerical Analysis Techniques
