Q${}^2$Forge: Minting Competency Questions and SPARQL Queries for Question-Answering Over Knowledge Graphs

Yousouf Taghzouti (WIMMICS; ICN); Franck Michel (Laboratoire I3S - SPARKS; WIMMICS); Tao Jiang (ICN); Louis-F\'elix Nothias (ICN); Fabien Gandon (WIMMICS; Laboratoire I3S - SPARKS)

arXiv:2505.13572·cs.DB·December 15, 2025

Q${}^2$Forge: Minting Competency Questions and SPARQL Queries for Question-Answering Over Knowledge Graphs

Yousouf Taghzouti (WIMMICS, ICN), Franck Michel (Laboratoire I3S - SPARKS, WIMMICS), Tao Jiang (ICN), Louis-F\'elix Nothias (ICN), Fabien Gandon (WIMMICS, Laboratoire I3S - SPARKS)

PDF

TL;DR

Q^2Forge is a modular framework that automates the creation and validation of competency questions and SPARQL queries for knowledge graphs, aiding non-experts and improving dataset quality for training language models.

Contribution

It introduces a generic, extensible pipeline for generating, validating, and refining competency questions and SPARQL queries, enhancing knowledge graph documentation and query dataset creation.

Findings

01

Supports creation of reference query sets for any KG

02

Iteratively validates queries with human feedback and LLMs

03

Open source and modular design facilitates customization

Abstract

The SPARQL query language is the standard method to access knowledge graphs (KGs). However, formulating SPARQL queries is a significant challenge for non-expert users, and remains time-consuming for the experienced ones. Best practices recommend to document KGs with competency questions and example queries to contextualise the knowledge they contain and illustrate their potential applications. In practice, however, this is either not the case or the examples are provided in limited numbers. Large Language Models (LLMs) are being used in conversational agents and are proving to be an attractive solution with a wide range of applications, from simple question-answering about common knowledge to generating code in a targeted programming language. However, training and testing these models to produce high quality SPARQL queries from natural language questions requires substantial datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.