A Benchmark for Gap and Overlap Analysis as a Test of KG Task Readiness
Maruf Ahmed Mridul, Rohit Kapa, Oshani Seneviratne

TL;DR
This paper introduces a benchmark for evaluating knowledge graph quality through gap and overlap analysis in policy documents, emphasizing explainability, reproducibility, and evidence-based validation.
Contribution
It provides an executable, auditable benchmark aligning natural language contracts with ontologies and evidence, enabling systematic comparison of different KG analysis methods.
Findings
Ontology-driven approach improves consistency in gap/overlap analysis.
Explicit modeling enhances diagnosis and reliability over text-only methods.
Benchmark supports evaluation of KG quality and downstream tasks.
Abstract
Task-oriented evaluation of knowledge graph (KG) quality increasingly asks whether an ontology-based representation can answer the competency questions that users actually care about, in a manner that is reproducible, explainable, and traceable to evidence. This paper adopts that perspective and focuses on gap and overlap analysis for policy-like documents (e.g., insurance contracts), where given a scenario, which documents support it (overlap) and which do not (gap), with defensible justifications. The resulting gap/overlap determinations are typically driven by genuine differences in coverage and restrictions rather than missing data, making the task a direct test of KG task readiness rather than a test of missing facts or query expressiveness. We present an executable and auditable benchmark that aligns natural-language contract text with a formal ontology and evidence-linked ground…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
