Echoes in AI: Quantifying lack of plot diversity in LLM outputs
Weijia Xu, Nebojsa Jojic, Sudha Rao, Chris Brockett, Bill Dolan

TL;DR
This paper investigates the lack of plot diversity in LLM-generated stories, introduces the Sui Generis score to quantify plot uniqueness, and finds that LLMs tend to echo similar plot elements, reducing creative diversity.
Contribution
The paper introduces the Sui Generis score as a novel automatic metric to measure plot element uniqueness in LLM outputs, highlighting issues of echoing and lack of diversity.
Findings
LLMs often echo plot elements across generations.
Sui Generis score correlates with human surprise judgments.
Human stories rarely have echoed plot elements.
Abstract
With rapid advances in large language models (LLMs), there has been an increasing application of LLMs in creative content ideation and generation. A critical question emerges: can current LLMs provide ideas that are diverse enough to truly bolster collective creativity? We examine two state-of-the-art LLMs, GPT-4 and LLaMA-3, on story generation and discover that LLM-generated stories often consist of plot elements that are echoed across a number of generations. To quantify this phenomenon, we introduce the Sui Generis score, an automatic metric that measures the uniqueness of a plot element among alternative storylines generated using the same prompt under an LLM. Evaluating on 100 short stories, we find that LLM-generated stories often contain combinations of idiosyncratic plot elements echoed frequently across generations and across different LLMs, while plots from the original…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsAttention Is All You Need · Byte Pair Encoding · Dense Connections · Absolute Position Encodings · Dropout · Linear Layer · Softmax · Adam · Residual Connection · Multi-Head Attention
