Would You Ask it that Way? Measuring and Improving Question Naturalness   for Knowledge Graph Question Answering

Trond Linjordet; Krisztian Balog

arXiv:2205.12768·cs.IR·May 26, 2022

Would You Ask it that Way? Measuring and Improving Question Naturalness for Knowledge Graph Question Answering

Trond Linjordet, Krisztian Balog

PDF

2 Repos 1 Datasets

TL;DR

This paper introduces the IQN-KGQA test collection to evaluate and improve the naturalness of questions in knowledge graph question answering, revealing that more realistic questions can challenge existing systems.

Contribution

The study develops a new dataset with rewritten, more natural questions and analyzes their impact on KGQA system performance, highlighting challenges in dataset construction.

Findings

01

Some KGQA systems perform worse with more natural questions

02

The IQN-KGQA dataset enables realistic evaluation of KGQA models

03

Rewritten questions improve fluency but may affect system accuracy

Abstract

Knowledge graph question answering (KGQA) facilitates information access by leveraging structured data without requiring formal query language expertise from the user. Instead, users can express their information needs by simply asking their questions in natural language (NL). Datasets used to train KGQA models that would provide such a service are expensive to construct, both in terms of expert and crowdsourced labor. Typically, crowdsourced labor is used to improve template-based pseudo-natural questions generated from formal queries. However, the resulting datasets often fall short of representing genuinely natural and fluent language. In the present work, we investigate ways to characterize and remedy these shortcomings. We create the IQN-KGQA test collection by sampling questions from existing KGQA datasets and evaluating them with regards to five different aspects of naturalness.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

atoroghi/CR-LT-KGQA
dataset· 29 dl
29 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methodstravel james