Towards Human-Level Book-Writing Capability

Jan Zierstek; Matteo Batelic; Maya Medjad; Tim Sch\"onenberger

arXiv:2605.17064·cs.AI·May 19, 2026

Towards Human-Level Book-Writing Capability

Jan Zierstek, Matteo Batelic, Maya Medjad, Tim Sch\"onenberger

PDF

TL;DR

This paper introduces a novel dataset and training framework for long-form creative writing with language models, aiming to produce more human-like literary texts by hierarchical planning and fine-tuning.

Contribution

It presents a prompt-to-book generation approach using hierarchical summaries to improve the quality and human-likeness of AI-generated fiction.

Findings

01

Model trained on prompt-to-book trajectories produces more stylistically diverse stories.

02

Hierarchical planning improves the structural coherence of generated books.

03

Training on human-authored fiction enhances literary quality of outputs.

Abstract

Large language models optimized for instruction following and agentic tasks remain poorly aligned with the requirements of high-quality creative writing. Fiction frequently depends on behaviors that assistant-tuned models are explicitly trained to avoid, particularly deception, moral ambiguity, and unreliable narration. As a result, generated stories often appear structurally correct while remaining stylistically generic, overly explanatory, or weakly grounded in human literary behavior. We present a dataset construction and training framework for book-scale creative writing that reframes supervised fine-tuning as a prompt-to-book generation task grounded in human-authored fiction. Starting from public-domain novels, we derive a multi-resolution planning scaffold by summarizing each book at progressively finer levels, from a high-level premise to chapter- and scene-level structure. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.