Benchmarking human visual search computational models in natural scenes:   models comparison and reference datasets

F. Travi (1); G. Ruarte (1); G. Bujia (1); J. E. Kamienkowski (1,2); ((1) Laboratorio de Inteligencia Artificial Aplicada; Instituto de Ciencias; de la Computaci\'on; Universidad de Buenos Aires - CONICET (2) Maestr\'ia de; Explotaci\'on de Datos y Descubrimiento del Conocimiento; Universidad de; Buenos Aires; Argentina)

arXiv:2112.05808·cs.CV·December 14, 2021·1 cites

Benchmarking human visual search computational models in natural scenes: models comparison and reference datasets

F. Travi (1), G. Ruarte (1), G. Bujia (1), J. E. Kamienkowski (1,2), ((1) Laboratorio de Inteligencia Artificial Aplicada, Instituto de Ciencias, de la Computaci\'on, Universidad de Buenos Aires - CONICET (2) Maestr\'ia de

PDF

Open Access 1 Repo

TL;DR

This paper evaluates and compares state-of-the-art human visual search models in natural scenes using standardized datasets and metrics, proposing an improved hybrid model and emphasizing the need for benchmarking resources.

Contribution

It provides a comprehensive benchmarking framework for visual search models, introduces an improved hybrid model, and highlights the importance of standardized datasets and metrics.

Findings

01

Current models show limitations in generalization.

02

Combining Bayesian and neural network approaches improves performance.

03

Benchmarking datasets are crucial for advancing models.

Abstract

Visual search is an essential part of almost any everyday human goal-directed interaction with the environment. Nowadays, several algorithms are able to predict gaze positions during simple observation, but few models attempt to simulate human behavior during visual search in natural scenes. Furthermore, these models vary widely in their design and exhibit differences in the datasets and metrics with which they were evaluated. Thus, there is a need for a reference point, on which each model can be tested and from where potential improvements can be derived. In the present work, we select publicly available state-of-the-art visual search models in natural scenes and evaluate them on different datasets, employing the same metrics to estimate their efficiency and similarity with human subjects. In particular, we propose an improvement to the Ideal Bayesian Searcher through a combination…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fermint/visualsearchbenchmark
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Gaze Tracking and Assistive Technology · Visual perception and processing mechanisms