CREPE: Open-Domain Question Answering with False Presuppositions

Xinyan Velocity Yu; Sewon Min; Luke Zettlemoyer; Hannaneh; Hajishirzi

arXiv:2211.17257·cs.CL·December 1, 2022·1 cites

CREPE: Open-Domain Question Answering with False Presuppositions

Xinyan Velocity Yu, Sewon Min, Luke Zettlemoyer, Hannaneh, Hajishirzi

PDF

Open Access 1 Repo

TL;DR

CREPE introduces a new open-domain QA dataset with real-world questions containing false presuppositions, highlighting challenges in factual verification and evidence retrieval for improved question answering systems.

Contribution

The paper presents CREPE, a novel dataset capturing presupposition failures in natural questions, and analyzes baseline model performance on this realistic, challenging QA task.

Findings

01

25% of questions contain false presuppositions

02

Existing models struggle to verify presupposition correctness

03

Evidence retrieval remains a key challenge

Abstract

Information seeking users often pose questions with false presuppositions, especially when asking about unfamiliar topics. Most existing question answering (QA) datasets, in contrast, assume all questions have well defined answers. We introduce CREPE, a QA dataset containing a natural distribution of presupposition failures from online information-seeking forums. We find that 25% of questions contain false presuppositions, and provide annotations for these presuppositions and their corrections. Through extensive baseline experiments, we show that adaptations of existing open-domain QA models can find presuppositions moderately well, but struggle when predicting whether a presupposition is factually correct. This is in large part due to difficulty in retrieving relevant evidence passages from a large text corpus. CREPE provides a benchmark to study question answering in the wild, and our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

velocitycavalry/crepe
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems · Advanced Graph Neural Networks