Multilingual Language Models Predict Human Reading Behavior

Nora Hollenstein; Federico Pirovano; Ce Zhang; Lena J\"ager; Lisa; Beinborn

arXiv:2104.05433·cs.CL·April 13, 2021

Multilingual Language Models Predict Human Reading Behavior

Nora Hollenstein, Federico Pirovano, Ce Zhang, Lena J\"ager, Lisa, Beinborn

PDF

1 Repo

TL;DR

This paper investigates whether large multilingual transformer models can accurately predict human reading behavior across multiple languages, revealing their implicit encoding of language importance similar to human processing.

Contribution

It demonstrates that multilingual transformer models like BERT and XLM can effectively predict human reading patterns across different languages, highlighting their potential to model human sentence processing.

Findings

01

BERT and XLM models predict eye tracking features accurately

02

Models show cross-language generalization in reading behavior prediction

03

Transformer models encode language importance akin to human cognition

Abstract

We analyze if large language models are able to predict patterns of human reading behavior. We compare the performance of language-specific and multilingual pretrained transformer models to predict reading time measures reflecting natural human sentence processing on Dutch, English, German, and Russian texts. This results in accurate models of human reading behavior, which indicates that transformer models implicitly encode relative importance in language in a way that is comparable to human processing mechanisms. We find that BERT and XLM models successfully predict a range of eye tracking features. In a series of experiments, we analyze the cross-domain and cross-language abilities of these models and show how they reflect human sentence processing.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DS3Lab/multilingual-gaze
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Weight Decay · Linear Warmup With Linear Decay · WordPiece · Byte Pair Encoding · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · BERT