Retell, Reward, Repeat: Reinforcement Learning for Narrative Theory-Informed Story Generation

David Y. Liu; Xanthe Muston; Aditya Joshi; Sebastian Sequoiah-Grayson

arXiv:2601.17226·cs.CL·January 27, 2026

Retell, Reward, Repeat: Reinforcement Learning for Narrative Theory-Informed Story Generation

David Y. Liu, Xanthe Muston, Aditya Joshi, Sebastian Sequoiah-Grayson

PDF

Open Access

TL;DR

This paper explores reinforcement learning as a post-training method for automatic story generation, guided by narrative principles, to produce more diverse and human-aligned stories compared to traditional supervised fine-tuning.

Contribution

It introduces a reinforcement learning approach based on narrative principles, demonstrating improved diversity and alignment in story generation over supervised methods.

Findings

01

d-RLAIF produces more diverse stories

02

Stories are better aligned with human narrative conventions

03

Reinforcement learning is a viable alternative to supervised fine-tuning

Abstract

Despite the subjective nature of storytelling, past works on automatic story generation (ASG) have relied on limited ground truths for training and evaluation. In this work, we explore reinforcement learning (d-RLAIF) as a post-training alternative to supervised fine-tuning (SFT). We first apply Todorov's Theory of Narrative Equilibrium to establish principles that define desirable ASG qualities. We prompt 7B and 14B LLM-as-judge models with our principles to test alignment with human annotators and provide reward signals during d-RLAIF. We use Gemini-3-Flash to evaluate the output of our post-trained models and compare them to human-written stories from the TimeTravel dataset. We show that d-RLAIF offers a viable alternative to supervised fine-tuning (SFT)--producing stories that are more diverse and aligned with human narrative conventions. Our paper demonstrates the promise of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Topic Modeling · Multimodal Machine Learning Applications