CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
Alon Talmor, Jonathan Herzig, Nicholas Lourie, Jonathan Berant

TL;DR
CommonsenseQA introduces a challenging dataset for evaluating AI's ability to answer questions requiring commonsense knowledge, highlighting the gap between current models and human performance.
Contribution
The paper presents a new dataset with 12,247 questions designed to test commonsense reasoning, created via crowd-sourcing from ConceptNet relations.
Findings
BERT-large achieves 56% accuracy on the dataset
Human performance is 89% accuracy
The dataset reveals significant challenges for current models
Abstract
When answering a question, people often draw upon their rich world knowledge in addition to the particular context. Recent work has focused primarily on answering questions given some relevant document or context, and required very little general background. To investigate question answering with prior knowledge, we present CommonsenseQA: a challenging new dataset for commonsense question answering. To capture common sense beyond associations, we extract from ConceptNet (Speer et al., 2017) multiple target concepts that have the same semantic relation to a single source concept. Crowd-workers are asked to author multiple-choice questions that mention the source concept and discriminate in turn between each of the target concepts. This encourages workers to create questions with complex semantics that often require prior knowledge. We create 12,247 questions through this procedure and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗google/gemma-7bmodel· 30k dl· ♡ 329330k dl♡ 3293
- 🤗google/gemma-2-2b-itmodel· 368k dl· ♡ 1314368k dl♡ 1314
- 🤗google/gemma-2-2bmodel· 489k dl· ♡ 636489k dl♡ 636
- 🤗google/gemma-2bmodel· 174k dl· ♡ 1152174k dl♡ 1152
- 🤗google/gemma-2-27b-itmodel· 309k dl· ♡ 561309k dl♡ 561
- 🤗google/gemma-2-9b-itmodel· 254k dl· ♡ 781254k dl♡ 781
- 🤗ataeff/recurrentgemma-2b-itmodel· ♡ 1♡ 1
- 🤗google/gemma-2b-itmodel· 57k dl· ♡ 86257k dl♡ 862
- 🤗google/gemma-7b-itmodel· 67k dl· ♡ 124167k dl♡ 1241
- 🤗alpindale/gemma-7bmodel· 66 dl· ♡ 766 dl♡ 7
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies
