Competency Questions and SPARQL-OWL Queries Dataset and Analysis
Dawid Wisniewski, Jedrzej Potoniec, Agnieszka Lawrynowicz, C. Maria, Keet

TL;DR
This paper presents a dataset of 234 competency questions and their SPARQL-OWL formalizations across various ontologies, analyzing linguistic patterns and query signatures to improve ontology testing and development practices.
Contribution
It introduces a publicly available dataset of CQs and SPARQL-OWL queries, along with an analysis of linguistic patterns and query signatures to enhance ontology engineering methodologies.
Findings
Identified 106 distinct CQ patterns with limited overlap across ontologies.
Discovered that one CQ pattern can correspond to multiple SPARQL-OWL query signatures.
Provided a dataset to support automation and standardization in CQ formulation and formalization.
Abstract
Competency Questions (CQs) are natural language questions outlining and constraining the scope of knowledge represented by an ontology. Despite that CQs are a part of several ontology engineering methodologies, we have observed that the actual publication of CQs for the available ontologies is very limited and even scarcer is the publication of their respective formalisations in terms of, e.g., SPARQL queries. This paper aims to contribute to addressing the engineering shortcomings of using CQs in ontology development, to facilitate wider use of CQs. In order to understand the relation between CQs and the queries over the ontology to test the CQs on an ontology, we gather, analyse, and publicly release a set of 234 CQs and their translations to SPARQL-OWL for several ontologies in different domains developed by different groups. We analysed the CQs in two principal ways. The first stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
