Enhancing RLHF with Human Gaze Modeling

Karim Galliamov; Ivan Titov; and Ilya Pershin

arXiv:2507.09016·cs.LG·July 17, 2025

Enhancing RLHF with Human Gaze Modeling

Karim Galliamov, Ivan Titov, and Ilya Pershin

PDF

Open Access 1 Video

TL;DR

This paper introduces methods that incorporate human gaze modeling into RLHF to improve training efficiency and reduce computational costs while maintaining or enhancing model performance.

Contribution

It presents novel gaze-aware reward models and gaze-based reward distribution techniques that leverage human gaze data to enhance RLHF.

Findings

01

Gaze-informed RLHF converges faster than traditional methods.

02

Gaze-based rewards maintain or slightly improve model performance.

03

Using gaze data reduces computational costs during training.

Abstract

Reinforcement Learning from Human Feedback (RLHF) aligns language models with human preferences but is computationally expensive. We explore two approaches that leverage human gaze modeling to enhance RLHF: (1) gaze-aware reward models and (2) gaze-based distribution of sparse rewards at token level. Our experiments demonstate that gaze-informed RLHF achieves faster convergence while maintaining or slightly improving performance, thus, reducing computational costs during policy optimization. These results show that human gaze provides a valuable and underused signal for policy optimization, pointing to a promising direction for improving RLHF efficiency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Enhancing RLHF with Human Gaze Modeling· underline

Taxonomy

TopicsGaze Tracking and Assistive Technology