CREAK: A Dataset for Commonsense Reasoning over Entity Knowledge

Yasumasa Onoe; Michael J.Q. Zhang; Eunsol Choi; Greg Durrett

arXiv:2109.01653·cs.CL·September 6, 2021·26 cites

CREAK: A Dataset for Commonsense Reasoning over Entity Knowledge

Yasumasa Onoe, Michael J.Q. Zhang, Eunsol Choi, Greg Durrett

PDF

Open Access 2 Repos 1 Models

TL;DR

CREAK is a new dataset designed to evaluate models' ability to perform commonsense reasoning about specific entities by combining fact-checking and inference tasks, highlighting challenges in natural language understanding.

Contribution

The paper introduces CREAK, a novel dataset for entity-based commonsense reasoning, and demonstrates its effectiveness in revealing limitations of current models in integrating factual and commonsense knowledge.

Findings

01

Models trained on CREAK outperform baseline fact verification models.

02

Current models still lag behind human performance on CREAK.

03

CREAK effectively probes models' ability to combine fact retrieval with unstated commonsense reasoning.

Abstract

Most benchmark datasets targeting commonsense reasoning focus on everyday scenarios: physical knowledge like knowing that you could fill a cup under a waterfall [Talmor et al., 2019], social knowledge like bumping into someone is awkward [Sap et al., 2019], and other generic situations. However, there is a rich space of commonsense inferences anchored to knowledge about specific entities: for example, deciding the truthfulness of a claim "Harry Potter can teach classes on how to fly on a broomstick." Can models learn to combine entity knowledge with commonsense reasoning in this fashion? We introduce CREAK, a testbed for commonsense reasoning about entity knowledge, bridging fact-checking about entities (Harry Potter is a wizard and is skilled at riding a broomstick) with commonsense inferences (if you're good at a skill you can teach others how to do it). Our dataset consists of 13k…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
fractalego/creak-sense
model· 12 dl· ♡ 1
12 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

MethodsWizard: Unsupervised goats tracking algorithm