A Benchmark for Gap and Overlap Analysis as a Test of KG Task Readiness

Maruf Ahmed Mridul; Rohit Kapa; Oshani Seneviratne

arXiv:2604.10853·cs.AI·April 14, 2026

A Benchmark for Gap and Overlap Analysis as a Test of KG Task Readiness

Maruf Ahmed Mridul, Rohit Kapa, Oshani Seneviratne

PDF

TL;DR

This paper introduces a benchmark for evaluating knowledge graph quality through gap and overlap analysis in policy documents, emphasizing explainability, reproducibility, and evidence-based validation.

Contribution

It provides an executable, auditable benchmark aligning natural language contracts with ontologies and evidence, enabling systematic comparison of different KG analysis methods.

Findings

01

Ontology-driven approach improves consistency in gap/overlap analysis.

02

Explicit modeling enhances diagnosis and reliability over text-only methods.

03

Benchmark supports evaluation of KG quality and downstream tasks.

Abstract

Task-oriented evaluation of knowledge graph (KG) quality increasingly asks whether an ontology-based representation can answer the competency questions that users actually care about, in a manner that is reproducible, explainable, and traceable to evidence. This paper adopts that perspective and focuses on gap and overlap analysis for policy-like documents (e.g., insurance contracts), where given a scenario, which documents support it (overlap) and which do not (gap), with defensible justifications. The resulting gap/overlap determinations are typically driven by genuine differences in coverage and restrictions rather than missing data, making the task a direct test of KG task readiness rather than a test of missing facts or query expressiveness. We present an executable and auditable benchmark that aligns natural-language contract text with a formal ontology and evidence-linked ground…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.