The Reader is the Metric: How Textual Features and Reader Profiles Explain Conflicting Evaluations of AI Creative Writing
Guillermo Marco, Julio Gonzalo, V\'ictor Fresno

TL;DR
This study investigates how different reader profiles influence evaluations of AI and human-created texts, revealing that evaluation discrepancies are largely due to reader preferences rather than text quality.
Contribution
The paper introduces a reader-sensitive evaluation framework by modeling individual reader preferences and analyzing their impact on literary quality assessments.
Findings
Readers cluster into surface-focused and holistic profiles.
Reader preferences significantly influence quality evaluations.
Textual feature importance varies across reader profiles.
Abstract
Recent studies comparing AI-generated and human-authored literary texts have produced conflicting results: some suggest AI already surpasses human quality, while others argue it still falls short. We start from the hypothesis that such divergences can be largely explained by genuine differences in how readers interpret and value literature, rather than by an intrinsic quality of the texts evaluated. Using five public datasets (1,471 stories, 101 annotators including critics, students, and lay readers), we (i) extract 17 reference-less textual features (e.g., coherence, emotional variance, average sentence length...); (ii) model individual reader preferences, deriving feature importance vectors that reflect their textual priorities; and (iii) analyze these vectors in a shared "preference space". Reader vectors cluster into two profiles: 'surface-focused readers' (mainly non-experts), who…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Authorship Attribution and Profiling · Media Influence and Health
MethodsALIGN
