Would You Ask it that Way? Measuring and Improving Question Naturalness for Knowledge Graph Question Answering
Trond Linjordet, Krisztian Balog

TL;DR
This paper introduces the IQN-KGQA test collection to evaluate and improve the naturalness of questions in knowledge graph question answering, revealing that more realistic questions can challenge existing systems.
Contribution
The study develops a new dataset with rewritten, more natural questions and analyzes their impact on KGQA system performance, highlighting challenges in dataset construction.
Findings
Some KGQA systems perform worse with more natural questions
The IQN-KGQA dataset enables realistic evaluation of KGQA models
Rewritten questions improve fluency but may affect system accuracy
Abstract
Knowledge graph question answering (KGQA) facilitates information access by leveraging structured data without requiring formal query language expertise from the user. Instead, users can express their information needs by simply asking their questions in natural language (NL). Datasets used to train KGQA models that would provide such a service are expensive to construct, both in terms of expert and crowdsourced labor. Typically, crowdsourced labor is used to improve template-based pseudo-natural questions generated from formal queries. However, the resulting datasets often fall short of representing genuinely natural and fluent language. In the present work, we investigate ways to characterize and remedy these shortcomings. We create the IQN-KGQA test collection by sampling questions from existing KGQA datasets and evaluating them with regards to five different aspects of naturalness.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
