SID4VAM: A Benchmark Dataset with Synthetic Images for Visual Attention   Modeling

David Berga; Xos\'e R. Fdez-Vidal; Xavier Otazu; Xos\'e M. Pardo

arXiv:1910.13066·cs.CV·October 30, 2019

SID4VAM: A Benchmark Dataset with Synthetic Images for Visual Attention Modeling

David Berga, Xos\'e R. Fdez-Vidal, Xavier Otazu, Xos\'e M. Pardo

PDF

2 Repos

TL;DR

This paper introduces SID4VAM, a synthetic image dataset for benchmarking visual saliency models, revealing that models inspired by spectral/Fourier features outperform others and align better with human perception.

Contribution

The paper presents SID4VAM, a novel synthetic dataset for evaluating saliency models, and demonstrates the superior performance of spectral/Fourier inspired models on this dataset.

Findings

01

Spectral/Fourier inspired models outperform others in saliency metrics.

02

Models perform poorly on synthetic pattern images compared to natural images.

03

Spectral/Fourier models are more consistent with human psychophysics.

Abstract

A benchmark of saliency models performance with a synthetic image dataset is provided. Model performance is evaluated through saliency metrics as well as the influence of model inspiration and consistency with human psychophysics. SID4VAM is composed of 230 synthetic images, with known salient regions. Images were generated with 15 distinct types of low-level features (e.g. orientation, brightness, color, size...) with a target-distractor pop-out type of synthetic patterns. We have used Free-Viewing and Visual Search task instructions and 7 feature contrasts for each feature category. Our study reveals that state-of-the-art Deep Learning saliency models do not perform well with synthetic pattern images, instead, models with Spectral/Fourier inspiration outperform others in saliency metrics and are more consistent with human psychophysical experimentation. This study proposes a new way…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.