A Psychophysically Oriented Saliency Map Prediction Model
Qiang Li

TL;DR
This paper introduces WECSF, a psychophysically inspired saliency prediction model that closely mimics human visual processing and outperforms existing models on various datasets, offering insights into primate vision mechanisms.
Contribution
The paper presents a novel saliency prediction architecture based on human visual cortex principles, integrating opponent color channels, wavelet transforms, and contrast sensitivity for improved accuracy.
Findings
WECSF outperforms state-of-the-art models on multiple datasets.
Fourier and spectral models excel on psychophysical synthetic images.
Deep neural networks require specific architectures for better psychophysical image prediction.
Abstract
Visual attention is one of the most significant characteristics for selecting and understanding the outside redundancy world. The human vision system cannot process all information simultaneously due to the visual information bottleneck. In order to reduce the redundant input of visual information, the human visual system mainly focuses on dominant parts of scenes. This is commonly known as visual saliency map prediction. This paper proposed a new psychophysical saliency prediction architecture, WECSF, inspired by multi-channel model of visual cortex functioning in humans. The model consists of opponent color channels, wavelet transform, wavelet energy map, and contrast sensitivity function for extracting low-level image features and providing a maximum approximation to the human visual system. The proposed model is evaluated using several datasets, including the MIT1003, MIT300,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Image and Video Quality Assessment · Advanced Image Fusion Techniques
