State-of-the-Art in Human Scanpath Prediction
Matthias K\"ummerer, Matthias Bethge

TL;DR
This paper reviews and evaluates current models predicting human eye movement scanpaths, proposing a new evaluation method that aligns better with biological processes and offers detailed insights into model performance.
Contribution
It introduces a novel evaluation approach for scanpath prediction models based on fixation-by-fixation accuracy, facilitating more biologically relevant assessments.
Findings
Models vary significantly in predictive accuracy.
The new evaluation method reveals specific failure points of existing models.
Benchmark datasets enable standardized comparison across models.
Abstract
The last years have seen a surge in models predicting the scanpaths of fixations made by humans when viewing images. However, the field is lacking a principled comparison of those models with respect to their predictive power. In the past, models have usually been evaluated based on comparing human scanpaths to scanpaths generated from the model. Here, instead we evaluate models based on how well they predict each fixation in a scanpath given the previous scanpath history. This makes model evaluation closely aligned with the biological processes thought to underly scanpath generation and allows to apply established saliency metrics like AUC and NSS in an intuitive and interpretable way. We evaluate many existing models of scanpath prediction on the datasets MIT1003, MIT300, CAT2000 train and CAT200 test, for the first time giving a detailed picture of the current state of the art of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Face Recognition and Perception · Advanced Neural Network Applications
