On the Predictive Power of Neural Language Models for Human Real-Time   Comprehension Behavior

Ethan Gotlieb Wilcox; Jon Gauthier; Jennifer Hu; Peng Qian; Roger; Levy

arXiv:2006.01912·cs.CL·June 4, 2020·107 cites

On the Predictive Power of Neural Language Models for Human Real-Time Comprehension Behavior

Ethan Gotlieb Wilcox, Jon Gauthier, Jennifer Hu, Peng Qian, Roger, Levy

PDF

Open Access 1 Repo

TL;DR

This study evaluates various neural language models to determine how well their predictions of next words align with human reading times, revealing architecture-dependent differences and the limited role of syntactic knowledge in predicting comprehension behavior.

Contribution

It systematically compares multiple neural language models' ability to predict human reading behavior and examines the relationship between model expectations, architecture, and syntactic knowledge.

Findings

01

Deep Transformer and n-gram models outperform others in psychometric predictive power.

02

Better next-word expectations correlate with improved prediction of human reading times.

03

Syntactic knowledge does not significantly enhance predictive power beyond perplexity levels.

Abstract

Human reading behavior is tuned to the statistics of natural language: the time it takes human subjects to read a word can be predicted from estimates of the word's probability in context. However, it remains an open question what computational architecture best characterizes the expectations deployed in real time by humans that determine the behavioral signatures of reading. Here we test over two dozen models, independently manipulating computational architecture and training dataset size, on how well their next-word expectations predict human reading time behavior on naturalistic text corpora. We find that across model architectures and training dataset sizes the relationship between word log-probability and reading time is (near-)linear. We next evaluate how features of these models determine their psychometric predictive power, or ability to predict human reading behavior. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wilcoxeg/neural-networks-read-times
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Neurobiology of Language and Bilingualism

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Label Smoothing · Multi-Head Attention · Adam · Sigmoid Activation · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout