# Computational Sentence‐Level Metrics of Reading Speed and Its Ramifications for Sentence Comprehension

**Authors:** Kun Sun, Rong Wang

PMC · DOI: 10.1111/cogs.70092 · Cognitive Science · 2025-07-22

## TL;DR

This paper introduces new computational methods to measure how quickly people read and understand whole sentences, showing they predict reading speed and comprehension difficulty effectively.

## Contribution

The paper introduces two novel sentence-level metrics—sentence surprisal and sentence relevance—for modeling reading speed and comprehension difficulty.

## Key findings

- Sentence-level metrics accurately predict human reading speed across languages.
- The metrics capture comprehension difficulty beyond what word-level measures can explain.
- The methods generalize well and offer strong interpretability for cognitive modeling.

## Abstract

The majority of research in computational psycholinguistics on sentence processing has focused on word‐by‐word incremental processing within sentences, rather than holistic sentence‐level representations. This study introduces two novel computational approaches for quantifying sentence‐level processing: sentence surprisal and sentence relevance. Using multilingual large language models (LLMs), we compute sentence surprisal through three methods, chain rule, next sentence prediction, and negative log‐likelihood, and apply a “memory‐aware” approach to calculate sentence‐level semantic relevance based on convolution operations. The sentence‐level metrics developed are tested and compared to validate whether they can predict the reading speed of sentences, and, further, we explore how sentence‐level metrics take effects on human processing and comprehending sentences as a whole across languages. The results show that sentence‐level metrics are highly capable of predicting sentence reading speed. Our results also indicate that these computational sentence‐level metrics are exceptionally effective at predicting and explaining the processing difficulties encountered by readers in processing sentences as a whole across a variety of languages. The proposed sentence‐level metrics offer significant interpretability and achieve high accuracy in predicting human sentence reading speed, as they capture unique aspects of comprehension difficulty beyond word‐level measures. These metrics serve as valuable computational tools for investigating human sentence processing and advancing our understanding of naturalistic reading. Their strong performance and generalization capabilities highlight their potential to drive progress at the intersection of LLMs and cognitive science.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12281087/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12281087/full.md

## References

89 references — full list in the complete paper: https://tomesphere.com/paper/PMC12281087/full.md

---
Source: https://tomesphere.com/paper/PMC12281087