Benchmarking triple stores with biological data
Vladimir Mironov, Nirmala Seethappan, Ward Blonde, Erick Antezana,, Bjorn Lindi, Martin Kuiper

TL;DR
This paper benchmarks five non-commercial triple stores using biological data, analyzing query performance, scalability, and reproducibility, revealing distinct strengths of each store across different query types.
Contribution
It provides a comparative analysis of triple store performance on biological data, highlighting query-specific behaviors and identifying the most balanced and efficient options.
Findings
Virtuoso showed balanced performance across all queries.
OWLIM excelled with short response time queries.
4Store performed best with moderate response time queries.
Abstract
We have compared the performance of five non-commercial triple stores, Virtuoso-open source, Jena SDB, Jena TDB, SWIFT-OWLIM and 4Store. We examined three performance aspects: the query execution time, scalability and run-to-run reproducibility. The queries we chose addressed different ontological or biological topics, and we obtained evidence that individual store performance was quite query specific. We identified three groups of queries displaying similar behavior across the different stores: 1) relatively short response time, 2) moderate response time and 3) relatively long response time. OWLIM proved to be a winner in the first group, 4Store in the second and Virtuoso in the third. Our benchmarking showed Virtuoso to be a very balanced performer - its response time was better than average for all the 24 queries; it showed a very good scalability and a reasonable run-to-run…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
