Leveraging recent advances in Pre-Trained Language Models forEye-Tracking Prediction
Varun Madhavan, Aditya Girish Pawate, Shraman Pal, Abhranil Chandra

TL;DR
This paper explores using pre-trained language models and eye-tracking data to predict gaze features during reading, proposing a novel hybrid neural architecture that improves prediction accuracy.
Contribution
It introduces a new neural network architecture combining RoBERTa and custom features for eye-tracking prediction, leveraging recent advances in language models.
Findings
The hybrid model outperforms individual models in MAE and R2 scores.
Feature engineering combined with pre-trained models enhances prediction accuracy.
The approach demonstrates the potential of language models in cognitive and linguistic tasks.
Abstract
Cognitively inspired Natural Language Pro-cessing uses human-derived behavioral datalike eye-tracking data, which reflect the seman-tic representations of language in the humanbrain to augment the neural nets to solve arange of tasks spanning syntax and semanticswith the aim of teaching machines about lan-guage processing mechanisms. In this paper,we use the ZuCo 1.0 and ZuCo 2.0 dataset con-taining the eye-gaze features to explore differ-ent linguistic models to directly predict thesegaze features for each word with respect to itssentence. We tried different neural networkmodels with the words as inputs to predict thetargets. And after lots of experimentation andfeature engineering finally devised a novel ar-chitecture consisting of RoBERTa Token Clas-sifier with a dense layer on top for languagemodeling and a stand-alone model consistingof dense layers followed by a transformer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Machine Learning in Bioinformatics
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · WordPiece · Adam · Softmax · Dropout · Dense Connections · Layer Normalization · Attention Dropout
