Visual Saliency Based on Scale-Space Analysis in the Frequency Domain
Jian Li, Martin Levine, Xiangjing An, Xin Xu, Hangen He

TL;DR
This paper introduces a novel frequency domain approach to visual saliency detection using scale-space analysis of the amplitude spectrum, effectively predicting human fixation points and highlighting salient regions of various sizes.
Contribution
It presents a new bottom-up saliency detection paradigm based on amplitude spectrum filtering and phase reconstruction, capable of detecting multiple scales of salient regions.
Findings
Predicts human fixation data accurately.
Highlights both small and large salient regions.
Inhibits distractors in cluttered images.
Abstract
We address the issue of visual saliency from three perspectives. First, we consider saliency detection as a frequency domain analysis problem. Second, we achieve this by employing the concept of {\it non-saliency}. Third, we simultaneously consider the detection of salient regions of different size. The paper proposes a new bottom-up paradigm for detecting visual saliency, characterized by a scale-space analysis of the amplitude spectrum of natural images. We show that the convolution of the {\it image amplitude spectrum} with a low-pass Gaussian kernel of an appropriate scale is equivalent to such an image saliency detector. The saliency map is obtained by reconstructing the 2-D signal using the original phase and the amplitude spectrum, filtered at a scale selected by minimizing saliency map entropy. A Hypercomplex Fourier Transform performs the analysis in the frequency domain. Using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
