WebNovelBench: Placing LLM Novelists on the Web Novel Distribution

Leon Lin; Jun Zheng; Haidong Wang

arXiv:2505.14818·cs.CL·May 22, 2025

WebNovelBench: Placing LLM Novelists on the Web Novel Distribution

Leon Lin, Jun Zheng, Haidong Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

WebNovelBench is a new benchmark that evaluates the storytelling quality of large language models on long-form Chinese web novels using a multi-dimensional, automated, and data-driven approach.

Contribution

It introduces a large-scale dataset, a novel multi-faceted evaluation framework, and a comprehensive analysis of state-of-the-art LLMs for long-form novel generation.

Findings

01

WebNovelBench effectively differentiates human and LLM-generated stories.

02

Ranking of 24 LLMs reveals varying storytelling capabilities.

03

Automated evaluation correlates well with human judgments.

Abstract

Robustly evaluating the long-form storytelling capabilities of Large Language Models (LLMs) remains a significant challenge, as existing benchmarks often lack the necessary scale, diversity, or objective measures. To address this, we introduce WebNovelBench, a novel benchmark specifically designed for evaluating long-form novel generation. WebNovelBench leverages a large-scale dataset of over 4,000 Chinese web novels, framing evaluation as a synopsis-to-story generation task. We propose a multi-faceted framework encompassing eight narrative quality dimensions, assessed automatically via an LLM-as-Judge approach. Scores are aggregated using Principal Component Analysis and mapped to a percentile rank against human-authored works. Our experiments demonstrate that WebNovelBench effectively differentiates between human-written masterpieces, popular web novels, and LLM-generated content. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

OedonLestrange42/webnovelbench
pytorch

Videos

WebNovelBench: Placing LLM Novelists on the Web Novel Distribution· underline

Taxonomy

TopicsWikis in Education and Collaboration · Mathematics, Computing, and Information Processing