Measuring and Modifying the Readability of English Texts with GPT-4
Sean Trott (1), Pamela D. Rivi\`ere (1) ((1) Department of, Cognitive Science, University of California San Diego)

TL;DR
This study evaluates GPT-4's ability to assess and modify English text readability, finding it correlates well with human judgments and can influence text difficulty, though with some variability and limitations.
Contribution
It provides empirical evidence that GPT-4 can reliably estimate and alter text readability, outperforming traditional formulas and psycholinguistic indices.
Findings
GPT-4 estimates correlate highly with human judgments
GPT-4 can reliably modify text difficulty
Variability in human perception remains significant
Abstract
The success of Large Language Models (LLMs) in other domains has raised the question of whether LLMs can reliably assess and manipulate the readability of text. We approach this question empirically. First, using a published corpus of 4,724 English text excerpts, we find that readability estimates produced ``zero-shot'' from GPT-4 Turbo and GPT-4o mini exhibit relatively high correlation with human judgments (r = 0.76 and r = 0.74, respectively), out-performing estimates derived from traditional readability formulas and various psycholinguistic indices. Then, in a pre-registered human experiment (N = 59), we ask whether Turbo can reliably make text easier or harder to read. We find evidence to support this hypothesis, though considerable variance in human judgments remains unexplained. We conclude by discussing the limitations of this approach, including limited scope, as well as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification
MethodsLinear Layer · Layer Normalization · Residual Connection · Position-Wise Feed-Forward Layer · Attention Is All You Need · Dense Connections · Softmax · Multi-Head Attention · Adam · Dropout
