Abductive Commonsense Reasoning

Chandra Bhagavatula; Ronan Le Bras; Chaitanya Malaviya; Keisuke; Sakaguchi; Ari Holtzman; Hannah Rashkin; Doug Downey; Scott Wen-tau Yih and; Yejin Choi

arXiv:1908.05739·cs.CL·February 17, 2020·34 cites

Abductive Commonsense Reasoning

Chandra Bhagavatula, Ronan Le Bras, Chaitanya Malaviya, Keisuke, Sakaguchi, Ari Holtzman, Hannah Rashkin, Doug Downey, Scott Wen-tau Yih and, Yejin Choi

PDF

Open Access 2 Repos 5 Datasets

TL;DR

This paper introduces a new dataset and tasks for evaluating abductive reasoning in natural language, revealing current models' limitations compared to human reasoning capabilities.

Contribution

It presents the first dataset and tasks for language-based abductive reasoning, highlighting the gap between model performance and human reasoning.

Findings

01

Models achieve 68.9% accuracy on abductive NLI, below 91.4% human performance.

02

Current language generators struggle with abductive explanations due to reasoning limitations.

03

Analysis uncovers specific reasoning types where models fail, guiding future research.

Abstract

Abductive reasoning is inference to the most plausible explanation. For example, if Jenny finds her house in a mess when she returns from work, and remembers that she left a window open, she can hypothesize that a thief broke into her house and caused the mess, as the most plausible explanation. While abduction has long been considered to be at the core of how people interpret and read between the lines in natural language (Hobbs et al., 1988), there has been relatively little research in support of abductive natural language inference and generation. We present the first study that investigates the viability of language-based abductive reasoning. We introduce a challenge dataset, ART, that consists of over 20k commonsense narrative contexts and 200k explanations. Based on this dataset, we conceptualize two new tasks -- (i) Abductive NLI: a multiple-choice question answering task for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications