GazBy: Gaze-Based BERT Model to Incorporate Human Attention in Neural Information Retrieval
Sibo Dong, Justin Goldstein, Grace Hui Yang

TL;DR
This paper introduces GazBy, a lightweight BERT-based model that incorporates human gaze signals to enhance neural information retrieval, demonstrating promising results on TREC datasets and highlighting potential for gaze data in future VR/AR search applications.
Contribution
The paper presents GazBy, a novel model integrating human gaze signals into transformer-based IR models, pioneering the use of gaze data in neural retrieval systems.
Findings
GazBy improves relevance prediction over baseline models.
Effective integration points for gaze signals are identified.
Gaze data shows potential for enhancing neural IR in VR/AR contexts.
Abstract
This paper is interested in investigating whether human gaze signals can be leveraged to improve state-of-the-art search engine performance and how to incorporate this new input signal marked by human attention into existing neural retrieval models. In this paper, we propose GazBy ({\bf Gaz}e-based {\bf B}ert model for document relevanc{\bf y}), a light-weight joint model that integrates human gaze fixation estimation into transformer models to predict document relevance, incorporating more nuanced information about cognitive processing into information retrieval (IR). We evaluate our model on the Text Retrieval Conference (TREC) Deep Learning (DL) 2019 and 2020 Tracks. Our experiments show encouraging results and illustrate the effective and ineffective entry points for using human gaze to help with transformer-based neural retrievers. With the rise of virtual reality (VR) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
