Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning

Jinlong Liu; Mohammed Bahja; Venelin Kovatchev; and Mark Lee

arXiv:2512.05747·cs.CL·April 22, 2026

Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning

Jinlong Liu, Mohammed Bahja, Venelin Kovatchev, and Mark Lee

PDF

1 Repo 5 Models 3 Datasets

TL;DR

This paper introduces a two-stage method using a style-similarity judge and GRPO fine-tuning to improve authorial style in long-form story generation, achieving higher style scores across multiple authors.

Contribution

It presents a novel approach combining a style judge with GRPO to enable controllable style transfer without accept/reject supervision.

Findings

01

The GRPO-trained model scores higher on style than baselines.

02

Style scores averaged 0.893 across four authors.

03

AV-calibrated reward modeling effectively controls style in generation.

Abstract

Evaluating and optimising authorial style in long-form story generation remains challenging because style is often assessed with ad hoc prompting and is frequently conflated with overall writing quality. We propose a two-stage pipeline. First, we train a dedicated style-similarity judge by fine-tuning a sentence-transformer with authorship-verification supervision, and calibrate its similarity outputs into a bounded $[0, 1]$ reward. Second, we use this judge as the primary reward in Group Relative Policy Optimization (GRPO) to fine-tune an 8B story generator for style-conditioned writing, avoiding the accept/reject supervision required by Direct Preference Optimization (DPO). Across four target authors (Mark Twain, Jane Austen, Charles Dickens, Thomas Hardy), the GRPO-trained 8B model achieves higher style scores than open-weight baselines, with an average style score of 0.893 across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vince-liuss/literary_style_model
github

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.