Measuring and Modifying the Readability of English Texts with GPT-4

Sean Trott (1); Pamela D. Rivi\`ere (1) ((1) Department of; Cognitive Science; University of California San Diego)

arXiv:2410.14028·cs.CL·October 21, 2024

Measuring and Modifying the Readability of English Texts with GPT-4

Sean Trott (1), Pamela D. Rivi\`ere (1) ((1) Department of, Cognitive Science, University of California San Diego)

PDF

Open Access 1 Repo

TL;DR

This study evaluates GPT-4's ability to assess and modify English text readability, finding it correlates well with human judgments and can influence text difficulty, though with some variability and limitations.

Contribution

It provides empirical evidence that GPT-4 can reliably estimate and alter text readability, outperforming traditional formulas and psycholinguistic indices.

Findings

01

GPT-4 estimates correlate highly with human judgments

02

GPT-4 can reliably modify text difficulty

03

Variability in human perception remains significant

Abstract

The success of Large Language Models (LLMs) in other domains has raised the question of whether LLMs can reliably assess and manipulate the readability of text. We approach this question empirically. First, using a published corpus of 4,724 English text excerpts, we find that readability estimates produced ``zero-shot'' from GPT-4 Turbo and GPT-4o mini exhibit relatively high correlation with human judgments (r = 0.76 and r = 0.74, respectively), out-performing estimates derived from traditional readability formulas and various psycholinguistic indices. Then, in a pre-registered human experiment (N = 59), we ask whether Turbo can reliably make text easier or harder to read. We find evidence to support this hypothesis, though considerable variance in human judgments remains unexplained. We conclude by discussing the limitations of this approach, including limited scope, as well as the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

seantrott/llm_readability
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification

MethodsLinear Layer · Layer Normalization · Residual Connection · Position-Wise Feed-Forward Layer · Attention Is All You Need · Dense Connections · Softmax · Multi-Head Attention · Adam · Dropout