Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning
Zhibo Yang, Lihan Huang, Yupei Chen, Zijun Wei, Seoyoung Ahn, Gregory, Zelinsky, Dimitris Samaras, Minh Hoai

TL;DR
This paper introduces an inverse reinforcement learning model to predict goal-directed human visual search behavior, utilizing a new large dataset and outperforming baselines in scanpath prediction and understanding object prioritization.
Contribution
The study presents the first IRL model for visual search, a new dataset COCO-Search18, and demonstrates improved prediction of goal-directed gaze behavior over existing models.
Findings
IRL model outperforms baselines in scanpath prediction
Reward maps reveal target-dependent object prioritization
Created the largest dataset of goal-directed search fixations
Abstract
Being able to predict human gaze behavior has obvious importance for behavioral vision and for computer vision applications. Most models have mainly focused on predicting free-viewing behavior using saliency maps, but these predictions do not generalize to goal-directed behavior, such as when a person searches for a visual target object. We propose the first inverse reinforcement learning (IRL) model to learn the internal reward function and policy used by humans during visual search. The viewer's internal belief states were modeled as dynamic contextual belief maps of object locations. These maps were learned by IRL and then used to predict behavioral scanpaths for multiple target categories. To train and evaluate our IRL model we created COCO-Search18, which is now the largest dataset of high-quality search fixations in existence. COCO-Search18 has 10 participants searching for each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Predicting Goal-Directed Human Attention Using Inverse Reinforcement Learning· youtube
Taxonomy
TopicsVisual Attention and Saliency Detection · Olfactory and Sensory Function Studies · Face Recognition and Perception
