A Psychophysically Oriented Saliency Map Prediction Model

Qiang Li

arXiv:2011.04076·cs.CV·November 5, 2024

A Psychophysically Oriented Saliency Map Prediction Model

Qiang Li

PDF

Open Access

TL;DR

This paper introduces WECSF, a psychophysically inspired saliency prediction model that closely mimics human visual processing and outperforms existing models on various datasets, offering insights into primate vision mechanisms.

Contribution

The paper presents a novel saliency prediction architecture based on human visual cortex principles, integrating opponent color channels, wavelet transforms, and contrast sensitivity for improved accuracy.

Findings

01

WECSF outperforms state-of-the-art models on multiple datasets.

02

Fourier and spectral models excel on psychophysical synthetic images.

03

Deep neural networks require specific architectures for better psychophysical image prediction.

Abstract

Visual attention is one of the most significant characteristics for selecting and understanding the outside redundancy world. The human vision system cannot process all information simultaneously due to the visual information bottleneck. In order to reduce the redundant input of visual information, the human visual system mainly focuses on dominant parts of scenes. This is commonly known as visual saliency map prediction. This paper proposed a new psychophysical saliency prediction architecture, WECSF, inspired by multi-channel model of visual cortex functioning in humans. The model consists of opponent color channels, wavelet transform, wavelet energy map, and contrast sensitivity function for extracting low-level image features and providing a maximum approximation to the human visual system. The proposed model is evaluated using several datasets, including the MIT1003, MIT300,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Image and Video Quality Assessment · Advanced Image Fusion Techniques