Cut the CARP: Fishing for zero-shot story evaluation
Shahbuland Matiana, JR Smith, Ryan Teehan, Louis Castricato, Stella, Biderman, Leo Gao, Spencer Frazier

TL;DR
This paper introduces CARP, a contrastive learning-based zero-shot method for evaluating stories that correlates well with human judgment, reducing the need for annotated datasets or finetuning.
Contribution
The paper presents CARP, a novel contrastive learning approach for zero-shot story evaluation, and introduces the Story-Critique Dataset with 1.3 million story-critique pairs.
Findings
CARP shows strong correlation with human evaluations.
Model outputs outperform finetuning-based methods.
The Story-Critique Dataset is a valuable resource for NLP research.
Abstract
Recent advances in large-scale language models (Raffel et al., 2019; Brown et al., 2020) have brought significant qualitative and quantitative improvements in machine-driven text generation. Despite this, generation and evaluation of machine-generated narrative text remains a challenging problem. Objective evaluation of computationally-generated stories may be prohibitively expensive, require meticulously annotated datasets, or may not adequately measure the logical coherence of a generated story's narratological structure. Informed by recent advances in contrastive learning (Radford et al., 2021), we present Contrastive Authoring and Reviewing Pairing (CARP): a scalable, efficient method for performing qualitatively superior, zero-shot evaluation of stories. We show a strong correlation between human evaluation of stories and those of CARP. Model outputs more significantly correlate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Cancer-related gene regulation
MethodsContrastive Learning
