CR-LT-KGQA: A Knowledge Graph Question Answering Dataset Requiring Commonsense Reasoning and Long-Tail Knowledge
Willis Guo, Armin Toroghi, Scott Sanner

TL;DR
This paper introduces CR-LT-KGQA, a novel dataset for knowledge graph question answering that emphasizes commonsense reasoning and long-tail entities, addressing limitations of existing datasets and challenging current LLMs.
Contribution
The creation of CR-LT-KGQA, a dataset supporting commonsense reasoning and focusing on long-tail entities, extending existing datasets and highlighting challenges for LLMs.
Findings
LLMs hallucinate frequently on CR-LT-KGQA
Baseline models struggle with commonsense inference
CR-LT-KGQA enables future research on factual long-tail entity answering
Abstract
Knowledge graph question answering (KGQA) is a well-established field that seeks to provide factual answers to natural language (NL) questions by leveraging knowledge graphs (KGs). However, existing KGQA datasets suffer from two significant limitations: (1) no existing KGQA dataset requires commonsense reasoning to arrive at an answer and (2) existing KGQA datasets focus on popular entities for which large language models (LLMs) can directly answer without hallucinating and without leveraging the KG. In this work, we seek a novel KGQA dataset that supports commonsense reasoning and focuses on long-tail entities (e.g., non-mainstream and recent entities) where LLMs frequently hallucinate, and thus create the need for novel methodologies that leverage the KG for factual and attributable commonsense inference. We create a novel Commonsense Reasoning (CR) and Long-Tail (LT) KGQA dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Topic Modeling · Advanced Graph Neural Networks
MethodsFocus
