LAST at CMCL 2021 Shared Task: Predicting Gaze Data During Reading with   a Gradient Boosting Decision Tree Approach

Yves Bestgen

arXiv:2104.13043·cs.CL·April 28, 2021

LAST at CMCL 2021 Shared Task: Predicting Gaze Data During Reading with a Gradient Boosting Decision Tree Approach

Yves Bestgen

PDF

TL;DR

This paper presents a LightGBM-based model that predicts eye-tracking data during reading, outperforming deep learning systems in the 2021 CMCL Shared Task by using lexical, psychometric, and bigram features.

Contribution

The paper introduces a gradient boosting decision tree approach that effectively predicts gaze data, surpassing deep learning methods in the shared task.

Findings

01

Achieved top performance on two eye-tracking measures.

02

Outperformed all deep-learning based systems in the challenge.

03

Ranked first on the official challenge criterion.

Abstract

A LightGBM model fed with target word lexical characteristics and features obtained from word frequency lists, psychometric data and bigram association measures has been optimized for the 2021 CMCL Shared Task on Eye-Tracking Data Prediction. It obtained the best performance of all teams on two of the five eye-tracking measures to predict, allowing it to rank first on the official challenge criterion and to outperform all deep-learning based systems participating in the challenge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsCrossmodal Contrastive Learning