Collateral facilitation in humans and language models

James A. Michaelov; Benjamin K. Bergen

arXiv:2211.05198·cs.CL·November 11, 2022

Collateral facilitation in humans and language models

James A. Michaelov, Benjamin K. Bergen

PDF

Open Access 1 Repo

TL;DR

This study compares human and language model predictions during language comprehension, revealing that both exhibit similar facilitation effects for anomalous words related to context, which informs our understanding of language processing.

Contribution

The paper demonstrates that contemporary transformer language models show similar collateral facilitation effects as humans when processing anomalous words in context.

Findings

01

Language models exhibit facilitation for anomalous words related to context.

02

Humans and models show similar prediction patterns during language comprehension.

03

Implications for understanding human and AI language processing.

Abstract

Are the predictions of humans and language models affected by similar things? Research suggests that while comprehending language, humans make predictions about upcoming words, with more predictable words being processed more easily. However, evidence also shows that humans display a similar processing advantage for highly anomalous words when these words are semantically related to the preceding context or to the most probable continuation. Using stimuli from 3 psycholinguistic experiments, we find that this is also almost always also the case for 8 contemporary transformer language models (BERT, ALBERT, RoBERTa, XLM-R, GPT-2, GPT-Neo, GPT-J, and XGLM). We then discuss the implications of this phenomenon for our understanding of both human language comprehension and the predictions made by language models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jmichaelov/collateral-facilitation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Cosine Annealing · Byte Pair Encoding · Dropout · Linear Warmup With Cosine Annealing · Attention Dropout · Weight Decay · Discriminative Fine-Tuning