SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev, Percy Liang

TL;DR
The paper introduces SQuAD, a large-scale reading comprehension dataset with over 100,000 questions from Wikipedia, enabling the development and evaluation of machine reading models.
Contribution
It provides a new, extensive dataset for machine comprehension, along with analysis of question types and baseline model performance.
Findings
Baseline model achieves 51.0% F1 score.
Human performance is 86.8%, highlighting the dataset's challenge.
Analysis reveals diverse reasoning types needed for answering questions.
Abstract
We present the Stanford Question Answering Dataset (SQuAD), a new reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage. We analyze the dataset to understand the types of reasoning required to answer the questions, leaning heavily on dependency and constituency trees. We build a strong logistic regression model, which achieves an F1 score of 51.0%, a significant improvement over a simple baseline (20%). However, human performance (86.8%) is much higher, indicating that the dataset presents a good challenge problem for future research. The dataset is freely available at https://stanford-qa.com
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗google-t5/t5-smallmodel· 1.9M dl· ♡ 5381.9M dl♡ 538
- 🤗google-t5/t5-largemodel· 451k dl· ♡ 253451k dl♡ 253
- 🤗google-t5/t5-11bmodel· 22k dl· ♡ 6922k dl♡ 69
- 🤗google-t5/t5-3bmodel· 428k dl· ♡ 52428k dl♡ 52
- 🤗google-t5/t5-basemodel· 1.8M dl· ♡ 7701.8M dl♡ 770
- 🤗p208p2002/bart-squad-nqg-hlmodel· 2 dl2 dl
- 🤗p208p2002/bart-squad-qg-hlmodel· 10 dl· ♡ 410 dl♡ 4
- 🤗p208p2002/gpt2-squad-nqg-hlmodel· 16 dl16 dl
- 🤗p208p2002/gpt2-squad-qg-hlmodel· 8 dl· ♡ 38 dl♡ 3
- 🤗p208p2002/t5-squad-nqg-hlmodel· 2 dl2 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
