On the Predictive Power of Neural Language Models for Human Real-Time Comprehension Behavior
Ethan Gotlieb Wilcox, Jon Gauthier, Jennifer Hu, Peng Qian, Roger, Levy

TL;DR
This study evaluates various neural language models to determine how well their predictions of next words align with human reading times, revealing architecture-dependent differences and the limited role of syntactic knowledge in predicting comprehension behavior.
Contribution
It systematically compares multiple neural language models' ability to predict human reading behavior and examines the relationship between model expectations, architecture, and syntactic knowledge.
Findings
Deep Transformer and n-gram models outperform others in psychometric predictive power.
Better next-word expectations correlate with improved prediction of human reading times.
Syntactic knowledge does not significantly enhance predictive power beyond perplexity levels.
Abstract
Human reading behavior is tuned to the statistics of natural language: the time it takes human subjects to read a word can be predicted from estimates of the word's probability in context. However, it remains an open question what computational architecture best characterizes the expectations deployed in real time by humans that determine the behavioral signatures of reading. Here we test over two dozen models, independently manipulating computational architecture and training dataset size, on how well their next-word expectations predict human reading time behavior on naturalistic text corpora. We find that across model architectures and training dataset sizes the relationship between word log-probability and reading time is (near-)linear. We next evaluate how features of these models determine their psychometric predictive power, or ability to predict human reading behavior. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Neurobiology of Language and Bilingualism
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Label Smoothing · Multi-Head Attention · Adam · Sigmoid Activation · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout
