Language Models Explain Word Reading Times Better Than Empirical Predictability
Markus J. Hofmann, Steffen Remus, Chris Biemann, Ralph Radach, Lars, Kuchinke

TL;DR
This study demonstrates that modern probabilistic language models, especially n-gram and RNNs, better predict word reading times during reading than traditional cloze completion probability, highlighting their effectiveness in capturing syntactic and semantic influences.
Contribution
The paper compares traditional CCP with advanced language models, showing that probabilistic models provide deeper insights into lexical retrieval and reading behavior.
Findings
N-gram and RNN models outperform CCP in predicting reading times.
Probabilistic models show stronger correlations with eye-movement measures.
Long-range semantic models like topic models are less predictive of reading times.
Abstract
Though there is a strong consensus that word length and frequency are the most important single-word features determining visual-orthographic access to the mental lexicon, there is less agreement as how to best capture syntactic and semantic factors. The traditional approach in cognitive reading research assumes that word predictability from sentence context is best captured by cloze completion probability (CCP) derived from human performance data. We review recent research suggesting that probabilistic language models provide deeper explanations for syntactic and semantic effects than CCP. Then we compare CCP with (1) Symbolic n-gram models consolidate syntactic and semantic short-range relations by computing the probability of a word to occur, given two preceding words. (2) Topic models rely on subsymbolic representations to capture long-range semantic similarity by word co-occurrence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
