Bench4KE: Benchmarking Automated Competency Question Generation

Anna Sofia Lippolis; Minh Davide Ragagni; Paolo Ciancarini; Andrea Giovanni Nuzzolese; Valentina Presutti

arXiv:2505.24554·cs.CL·December 10, 2025

Bench4KE: Benchmarking Automated Competency Question Generation

Anna Sofia Lippolis, Minh Davide Ragagni, Paolo Ciancarini, Andrea Giovanni Nuzzolese, Valentina Presutti

PDF

1 Repo

TL;DR

Bench4KE introduces a standardized benchmarking system for evaluating automated Competency Question generation tools using real-world datasets and similarity metrics, enabling fair comparison and fostering progress in KE automation research.

Contribution

It provides an extensible, standardized benchmarking framework with curated datasets and evaluation metrics for CQ generation tools, addressing the lack of evaluation consistency in KE automation.

Findings

01

Six CQ generation systems evaluated, establishing a baseline.

02

Benchmarking system facilitates fair comparison of KE automation tools.

03

Code and datasets are publicly available for reproducibility.

Abstract

The availability of Large Language Models (LLMs) presents a unique opportunity to reinvigorate research on Knowledge Engineering (KE) automation. This trend is already evident in recent efforts developing LLM-based methods and tools for the automatic generation of Competency Questions (CQs), natural language questions used by ontology engineers to define the functional requirements of an ontology. However, the evaluation of these tools lacks standardization. This undermines the methodological rigor and hinders the replication and comparison of results. To address this gap, we introduce Bench4KE, an extensible API-based benchmarking system for KE automation. The presented release focuses on evaluating tools that generate CQs automatically. Bench4KE provides a curated gold standard consisting of CQ datasets from 17 real-world ontology engineering projects and uses a suite of similarity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fossr-project/ontogenia-cini
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsOntology