Readability Formulas, Systems and LLMs are Poor Predictors of Reading Ease
Keren Gruteke Klein, Shachar Frenkel, Omer Shubi, Yevgeni Berzak

TL;DR
This study evaluates traditional and modern readability scoring methods using real-time eye tracking data, revealing their poor predictive power for actual reading ease across diverse readers and texts, and emphasizing the need for cognitively grounded approaches.
Contribution
Introduces a novel evaluation framework for readability methods based on real-time eye tracking data, demonstrating the inadequacy of existing models and highlighting the potential of psycholinguistic features.
Findings
Traditional and ML-based readability formulas perform poorly in predicting real-time reading ease.
Psycholinguistic word properties often outperform existing readability measures.
Current methods are ineffective across different reader groups and text lengths.
Abstract
Methods for scoring text readability have been studied for over a century, and are widely used in research and in user-facing applications in many domains. Thus far, the development and evaluation of such methods have primarily relied on two types of offline behavioral data, performance on reading comprehension tests and ratings of text readability levels. In this work, we instead focus on a fundamental and understudied aspect of readability, real-time reading ease, captured with online reading measures using eye tracking. We introduce an evaluation framework for readability scoring methods which quantifies their ability to account for reading ease, while controlling for content variation across texts. Applying this evaluation to prominent traditional readability formulas, modern machine learning systems, frontier Large Language Models and commercial systems used in education, suggests…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification
