CREAK: A Dataset for Commonsense Reasoning over Entity Knowledge
Yasumasa Onoe, Michael J.Q. Zhang, Eunsol Choi, Greg Durrett

TL;DR
CREAK is a new dataset designed to evaluate models' ability to perform commonsense reasoning about specific entities by combining fact-checking and inference tasks, highlighting challenges in natural language understanding.
Contribution
The paper introduces CREAK, a novel dataset for entity-based commonsense reasoning, and demonstrates its effectiveness in revealing limitations of current models in integrating factual and commonsense knowledge.
Findings
Models trained on CREAK outperform baseline fact verification models.
Current models still lag behind human performance on CREAK.
CREAK effectively probes models' ability to combine fact retrieval with unstated commonsense reasoning.
Abstract
Most benchmark datasets targeting commonsense reasoning focus on everyday scenarios: physical knowledge like knowing that you could fill a cup under a waterfall [Talmor et al., 2019], social knowledge like bumping into someone is awkward [Sap et al., 2019], and other generic situations. However, there is a rich space of commonsense inferences anchored to knowledge about specific entities: for example, deciding the truthfulness of a claim "Harry Potter can teach classes on how to fly on a broomstick." Can models learn to combine entity knowledge with commonsense reasoning in this fashion? We introduce CREAK, a testbed for commonsense reasoning about entity knowledge, bridging fact-checking about entities (Harry Potter is a wizard and is skilled at riding a broomstick) with commonsense inferences (if you're good at a skill you can teach others how to do it). Our dataset consists of 13k…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management
MethodsWizard: Unsupervised goats tracking algorithm
