OBDA for the Web: Creating Virtual RDF Graphs On Top of Web Data Sources
Konstantina Bereta, George Papadakis, Manolis Koubarakis

TL;DR
This paper introduces Ontop4theWeb, a system that creates virtual RDF graphs from diverse web data sources like HTML tables and APIs, enabling real-time, on-the-fly querying without data materialization.
Contribution
It presents a novel framework for mapping heterogeneous web data into virtual RDF graphs, supporting efficient SPARQL querying across various web data formats.
Findings
Supports querying of HTML tables and Web APIs using SPARQL
Demonstrates high efficiency and performance in experiments
Outperforms current state-of-the-art in functionality and speed
Abstract
Due to Variety, Web data come in many different structures and formats, with HTML tables and REST APIs (e.g., social media APIs) being among the most popular ones. A big subset of Web data is also characterised by Velocity, as data gets frequently updated so that consumers can obtain the most up-to-date version of the respective datasets. At the moment, though, these data sources are not effectively supported by Semantic Web tools. To address variety and velocity, we propose Ontop4theWeb, a system that maps Web data of various formats into virtual RDF triples, thus allowing for querying them on-the-fly without materializing them as RDF. We demonstrate how Ontop4theWeb can use SPARQL to uniformly query popular, but heterogeneous Web data sources, like HTML tables and Web APIs. We showcase our approach in a number of use cases, such as Twitter, Foursquare, Yelp and HTML tables. We carried…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Web Data Mining and Analysis · Data Quality and Management
