Human Scanpath Prediction in Target-Present Visual Search with Semantic-Foveal Bayesian Attention
Jo\~ao Luzio, Alexandre Bernardino, Plinio Moreno

TL;DR
This paper introduces SemBA-FAST, a novel top-down Bayesian attention model that predicts human visual scanpaths during target-present search tasks by integrating deep object detection with semantic fusion, outperforming existing models.
Contribution
The work presents SemBA-FAST, a new semantic-foveal Bayesian framework that improves human scanpath prediction accuracy in visual search tasks using probabilistic semantic integration.
Findings
SemBA-FAST closely matches human scanpaths on COCO-Search18.
It outperforms baseline and other top-down models.
In some cases, it rivals scanpath-informed models.
Abstract
In goal-directed visual tasks, human perception is guided by both top-down and bottom-up cues. At the same time, foveal vision plays a crucial role in directing attention efficiently. Modern research on bio-inspired computational attention models has taken advantage of advancements in deep learning by utilizing human scanpath data to achieve new state-of-the-art performance. In this work, we assess the performance of SemBA-FAST, i.e. Semantic-based Bayesian Attention for Foveal Active visual Search Tasks, a top-down framework designed for predicting human visual attention in target-present visual search. SemBA-FAST integrates deep object detection with a probabilistic semantic fusion mechanism to generate attention maps dynamically, leveraging pre-trained detectors and artificial foveation to update top-down knowledge and improve fixation prediction sequentially. We evaluate SemBA-FAST…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
