$\text{EFO}_{k}$-CQA: Towards Knowledge Graph Complex Query Answering beyond Set Operation
Hang Yin, Zihao Wang, Weizhi Fei, Yangqiu Song

TL;DR
This paper introduces a new dataset and framework for complex knowledge graph query answering that extends existing set operation-based methods, enabling better evaluation of reasoning over incomplete knowledge.
Contribution
It proposes a comprehensive framework for data generation, model training, and evaluation of $ ext{EFO}_{k}$ queries, and constructs a new dataset with diverse query types to improve benchmarking.
Findings
The dataset includes 741 query types for thorough evaluation.
Existing datasets are systematically biased, affecting method development.
Query hardness impacts the performance of reasoning methods.
Abstract
To answer complex queries on knowledge graphs, logical reasoning over incomplete knowledge is required due to the open-world assumption. Learning-based methods are essential because they are capable of generalizing over unobserved knowledge. Therefore, an appropriate dataset is fundamental to both obtaining and evaluating such methods under this paradigm. In this paper, we propose a comprehensive framework for data generation, model training, and method evaluation that covers the combinatorial space of Existential First-order Queries with multiple variables (). The combinatorial query space in our framework significantly extends those defined by set operations in the existing literature. Additionally, we construct a dataset, -CQA, with 741 types of query for empirical evaluation, and our benchmark results provide new insights into how query hardness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Bayesian Modeling and Causal Inference · Data Quality and Management
