Do Massively Pretrained Language Models Make Better Storytellers?

Abigail See; Aneesh Pappu; Rohun Saxena; Akhila Yerukola; Christopher; D. Manning

arXiv:1909.10705·cs.CL·September 25, 2019

Do Massively Pretrained Language Models Make Better Storytellers?

Abigail See, Aneesh Pappu, Rohun Saxena, Akhila Yerukola, Christopher, D. Manning

PDF

1 Repo

TL;DR

This paper evaluates whether large pretrained language models like GPT2-117 produce better storytelling than specialized models, finding they excel in context sensitivity but still face issues like repetition and lack of diversity.

Contribution

It provides a detailed comparison of pretrained and specialized story generation models using automatic metrics, highlighting strengths and limitations of large pretrained models.

Findings

01

GPT2-117 conditions more strongly on context

02

More sensitive to event order and uses more unusual words

03

Still produces repetitive and less diverse text

Abstract

Large neural language models trained on massive amounts of text have emerged as a formidable strategy for Natural Language Understanding tasks. However, the strength of these models as Natural Language Generators is less clear. Though anecdotal evidence suggests that these models generate better quality text, there has been no detailed study characterizing their generation abilities. In this work, we compare the performance of an extensively pretrained model, OpenAI GPT2-117 (Radford et al., 2019), to a state-of-the-art neural story generation model (Fan et al., 2018). By evaluating the generated text across a wide variety of automatic metrics, we characterize the ways in which pretrained models do, and do not, make better storytellers. We find that although GPT2-117 conditions more strongly on context, is more sensitive to ordering of events, and uses more unusual words, it is just as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

abisee/story-generation-eval
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.