A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories
Nasrin Mostafazadeh, Nathanael Chambers, Xiaodong He, Devi Parikh,, Dhruv Batra, Lucy Vanderwende, Pushmeet Kohli, James Allen

TL;DR
This paper introduces the ROCStories corpus and the Story Cloze Test, a new evaluation framework for assessing systems' understanding of commonsense stories, highlighting the difficulty of current models in capturing causal and temporal relations.
Contribution
It provides a large, high-quality corpus of commonsense stories and a novel evaluation framework to measure deeper story understanding in NLP models.
Findings
Baseline models perform poorly on the Story Cloze Test.
The corpus captures rich causal and temporal relations.
Current models struggle with deep story understanding.
Abstract
Representation and learning of commonsense knowledge is one of the foundational problems in the quest to enable deep language understanding. This issue is particularly challenging for understanding casual and correlational relationships between events. While this topic has received a lot of interest in the NLP community, research has been hindered by the lack of a proper evaluation framework. This paper attempts to address this problem with a new framework for evaluating story understanding and script learning: the 'Story Cloze Test'. This test requires a system to choose the correct ending to a four-sentence story. We created a new corpus of ~50k five-sentence commonsense stories, ROCStories, to enable this evaluation. This corpus is unique in two ways: (1) it captures a rich set of causal and temporal commonsense relations between daily events, and (2) it is a high quality collection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
