Benchmarking human visual search computational models in natural scenes: models comparison and reference datasets
F. Travi (1), G. Ruarte (1), G. Bujia (1), J. E. Kamienkowski (1,2), ((1) Laboratorio de Inteligencia Artificial Aplicada, Instituto de Ciencias, de la Computaci\'on, Universidad de Buenos Aires - CONICET (2) Maestr\'ia de

TL;DR
This paper evaluates and compares state-of-the-art human visual search models in natural scenes using standardized datasets and metrics, proposing an improved hybrid model and emphasizing the need for benchmarking resources.
Contribution
It provides a comprehensive benchmarking framework for visual search models, introduces an improved hybrid model, and highlights the importance of standardized datasets and metrics.
Findings
Current models show limitations in generalization.
Combining Bayesian and neural network approaches improves performance.
Benchmarking datasets are crucial for advancing models.
Abstract
Visual search is an essential part of almost any everyday human goal-directed interaction with the environment. Nowadays, several algorithms are able to predict gaze positions during simple observation, but few models attempt to simulate human behavior during visual search in natural scenes. Furthermore, these models vary widely in their design and exhibit differences in the datasets and metrics with which they were evaluated. Thus, there is a need for a reference point, on which each model can be tested and from where potential improvements can be derived. In the present work, we select publicly available state-of-the-art visual search models in natural scenes and evaluate them on different datasets, employing the same metrics to estimate their efficiency and similarity with human subjects. In particular, we propose an improvement to the Ideal Bayesian Searcher through a combination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Gaze Tracking and Assistive Technology · Visual perception and processing mechanisms
