Measuring the Impact of (Psycho-)Linguistic and Readability Features and   Their Spill Over Effects on the Prediction of Eye Movement Patterns

Daniel Wiechmann; Yu Qiao; Elma Kerz; Justus Mattern

arXiv:2203.08085·cs.CL·March 16, 2022

Measuring the Impact of (Psycho-)Linguistic and Readability Features and Their Spill Over Effects on the Prediction of Eye Movement Patterns

Daniel Wiechmann, Yu Qiao, Elma Kerz, Justus Mattern

PDF

TL;DR

This study investigates how various linguistic and readability features, along with transformer model architectures, influence the prediction of eye movement patterns during natural reading, revealing the importance of feature selection and model design.

Contribution

It systematically evaluates the impact of diverse text features and transformer architectures on eye-tracking prediction, highlighting their combined effects.

Findings

01

Both features and model architecture significantly affect prediction accuracy.

02

Certain linguistic features are more influential in predicting eye movements.

03

Transformer models' performance varies with feature inclusion.

Abstract

There is a growing interest in the combined use of NLP and machine learning methods to predict gaze patterns during naturalistic reading. While promising results have been obtained through the use of transformer-based language models, little work has been undertaken to relate the performance of such models to general text characteristics. In this paper we report on experiments with two eye-tracking corpora of naturalistic reading and two language models (BERT and GPT-2). In all experiments, we test effects of a broad spectrum of features for predicting human reading behavior that fall into five categories (syntactic complexity, lexical richness, register-based multiword combinations, readability and psycholinguistic word properties). Our experiments show that both the features included and the architecture of the transformer-based language models play a role in predicting multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.