Leveraging recent advances in Pre-Trained Language Models   forEye-Tracking Prediction

Varun Madhavan; Aditya Girish Pawate; Shraman Pal; Abhranil Chandra

arXiv:2110.04475·cs.CL·October 12, 2021

Leveraging recent advances in Pre-Trained Language Models forEye-Tracking Prediction

Varun Madhavan, Aditya Girish Pawate, Shraman Pal, Abhranil Chandra

PDF

Open Access 1 Repo

TL;DR

This paper explores using pre-trained language models and eye-tracking data to predict gaze features during reading, proposing a novel hybrid neural architecture that improves prediction accuracy.

Contribution

It introduces a new neural network architecture combining RoBERTa and custom features for eye-tracking prediction, leveraging recent advances in language models.

Findings

01

The hybrid model outperforms individual models in MAE and R2 scores.

02

Feature engineering combined with pre-trained models enhances prediction accuracy.

03

The approach demonstrates the potential of language models in cognitive and linguistic tasks.

Abstract

Cognitively inspired Natural Language Pro-cessing uses human-derived behavioral datalike eye-tracking data, which reflect the seman-tic representations of language in the humanbrain to augment the neural nets to solve arange of tasks spanning syntax and semanticswith the aim of teaching machines about lan-guage processing mechanisms. In this paper,we use the ZuCo 1.0 and ZuCo 2.0 dataset con-taining the eye-gaze features to explore differ-ent linguistic models to directly predict thesegaze features for each word with respect to itssentence. We tried different neural networkmodels with the words as inputs to predict thetargets. And after lots of experimentation andfeature engineering finally devised a novel ar-chitecture consisting of RoBERTa Token Clas-sifier with a dense layer on top for languagemodeling and a stand-alone model consistingof dense layers followed by a transformer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

abhra-nilIITKgp/CMCL-2021-Predicting-human-reading-patterns
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Machine Learning in Bioinformatics

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · WordPiece · Adam · Softmax · Dropout · Dense Connections · Layer Normalization · Attention Dropout