SituatedQA: Incorporating Extra-Linguistic Contexts into QA

Michael J.Q. Zhang; Eunsol Choi

arXiv:2109.06157·cs.CL·September 14, 2021

SituatedQA: Incorporating Extra-Linguistic Contexts into QA

Michael J.Q. Zhang, Eunsol Choi

PDF

1 Repo

TL;DR

SituatedQA introduces a new dataset for question answering that emphasizes the importance of extra-linguistic context like time and location, revealing current models' struggles with context-dependent answers and highlighting the need for more context-aware systems.

Contribution

The paper presents SituatedQA, a novel open-retrieval QA dataset incorporating temporal and geographical contexts, and analyzes the limitations of existing models in handling context-dependent questions.

Findings

01

Models perform poorly on context-dependent questions.

02

Existing models show a 15-point accuracy drop with updated evidence.

03

Many questions in existing datasets are context-sensitive, requiring extra-linguistic information.

Abstract

Answers to the same question may change depending on the extra-linguistic contexts (when and where the question was asked). To study this challenge, we introduce SituatedQA, an open-retrieval QA dataset where systems must produce the correct answer to a question given the temporal or geographical context. To construct SituatedQA, we first identify such questions in existing QA datasets. We find that a significant proportion of information seeking questions have context-dependent answers (e.g., roughly 16.5% of NQ-Open). For such context-dependent questions, we then crowdsource alternative contexts and their corresponding answers. Our study shows that existing models struggle with producing answers that are frequently updated or from uncommon locations. We further quantify how existing models, which are trained on data collected in the past, fail to generalize to answering questions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mikejqzhang/SituatedQA
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.