Predicting Visual Attention and Distraction During Visual Search Using Convolutional Neural Networks
Manoosh Samiei, James J. Clark

TL;DR
This paper introduces two deep learning models to predict human visual attention and distraction during goal-oriented visual search tasks, achieving high accuracy and providing new segmentation annotations.
Contribution
It presents a novel adaptation of saliency models and a segmentation-based approach for modeling attention and distraction in visual search, with publicly available code and annotations.
Findings
High accuracy in saliency prediction (AUC-Judd=0.95)
Effective object-based distractor and target detection (F1-score=0.64)
Provision of new segmentation annotations for COCO-Search18
Abstract
Most studies in computational modeling of visual attention encompass task-free observation of images. Free-viewing saliency considers limited scenarios of daily life. Most visual activities are goal-oriented and demand a great amount of top-down attention control. Visual search task demands more top-down control of attention, compared to free-viewing. In this paper, we present two approaches to model visual attention and distraction of observers during visual search. Our first approach adapts a light-weight free-viewing saliency model to predict eye fixation density maps of human observers over pixels of search images, using a two-stream convolutional encoder-decoder network, trained and evaluated on COCO-Search18 dataset. This method predicts which locations are more distracting when searching for a particular target. Our network achieves good results on standard saliency metrics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Gaze Tracking and Assistive Technology
