TL;DR
This paper presents RMLStreamer-SISO, a scalable RDF stream generator from heterogeneous data streams that achieves low latency and high throughput, enabling efficient integration with stream reasoning systems.
Contribution
It extends RMLStreamer to generate RDF streams from dynamic heterogeneous data streams using a dynamic window approach, improving scalability and performance.
Findings
Achieves millisecond latency compared to seconds of previous solutions.
Maintains constant memory usage across workloads.
Sustains throughput of around 70,000 records/sec.
Abstract
Stream-reasoning query languages such as CQELS and C-SPARQL enable query answering over RDF streams. Unfortunately, there currently is a lack of efficient RDF stream generators to feed RDF stream reasoners. State-of-the-art RDF stream generators are limited with regard to the velocity and volume of streaming data they can handle. To efficiently generate RDF streams in a scalable way, we extended the RMLStreamer to also generate RDF streams from dynamic heterogeneous data streams. This paper introduces a scalable solution that relies on a dynamic window approach to generate RDF streams with low latency and high throughput from multiple heterogeneous data streams. Our evaluation shows that our solution outperforms the state-of-the-art by achieving millisecond latency (compared to seconds that state-of-the-art solutions need), constant memory usage for all workloads, and sustainable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
