CoQA: A Conversational Question Answering Challenge
Siva Reddy, Danqi Chen, Christopher D. Manning

TL;DR
CoQA is a new dataset designed to advance conversational question answering systems by providing a large, diverse set of human-annotated conversational questions and answers across multiple domains, highlighting unique challenges.
Contribution
The paper introduces CoQA, a large-scale, multi-domain dataset for conversational QA, and provides baseline evaluations demonstrating its complexity and the gap with human performance.
Findings
Strong models achieve 65.4% F1 score
Conversational questions involve coreference and pragmatic reasoning
Significant room for improvement over human performance
Abstract
Humans gather information by engaging in conversations involving a series of interconnected questions and answers. For machines to assist in information gathering, it is therefore essential to enable them to answer conversational questions. We introduce CoQA, a novel dataset for building Conversational Question Answering systems. Our dataset contains 127k questions with answers, obtained from 8k conversations about text passages from seven diverse domains. The questions are conversational, and the answers are free-form text with their corresponding evidence highlighted in the passage. We analyze CoQA in depth and show that conversational questions have challenging phenomena not present in existing reading comprehension datasets, e.g., coreference and pragmatic reasoning. We evaluate strong conversational and reading comprehension models on CoQA. The best system obtains an F1 score of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
