Enriching Ontology-based Data Access with Provenance (Extended Version)
Diego Calvanese, Davide Lanti, Ana Ozaki, Rafael Penaloza, Guohui Xiao

TL;DR
This paper enhances ontology-based data access by integrating provenance semirings, enabling explanation of query results and addressing complexity issues, with practical implementation and evaluation.
Contribution
It introduces provenance annotations into OBDA, studies entailment and polynomial computation for provenance, and demonstrates practical feasibility with a system implementation.
Findings
Provenance annotations can be effectively integrated into OBDA.
The approach is feasible for DL-Lite ontologies with finite provenance polynomials.
Experimental evaluation shows practical applicability on benchmark datasets.
Abstract
Ontology-based data access (OBDA) is a popular paradigm for querying heterogeneous data sources by connecting them through mappings to an ontology. In OBDA, it is often difficult to reconstruct why a tuple occurs in the answer of a query. We address this challenge by enriching OBDA with provenance semirings, taking inspiration from database theory. In particular, we investigate the problems of (i) deciding whether a provenance annotated OBDA instance entails a provenance annotated conjunctive query, and (ii) computing a polynomial representing the provenance of a query entailed by a provenance annotated OBDA instance. Differently from pure databases, in our case these polynomials may be infinite. To regain finiteness, we consider idempotent semirings, and study the complexity in the case of DL-Lite ontologies. We implement Task (ii) in a state-of-the-art OBDA system and show the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Semantic Web and Ontologies · Data Quality and Management
