PAANet:Visual Perception based Four-stage Framework for Salient Object   Detection using High-order Contrast Operator

Yanbo Yuan; Hua Zhong; Haixiong Li; Xiao cheng; Linmei Xia

arXiv:2211.08724·cs.CV·November 17, 2022

PAANet:Visual Perception based Four-stage Framework for Salient Object Detection using High-order Contrast Operator

Yanbo Yuan, Hua Zhong, Haixiong Li, Xiao cheng, Linmei Xia

PDF

Open Access

TL;DR

PAANet introduces a four-stage saliency detection framework inspired by human vision, utilizing a novel high-order contrast operator for improved semantic feature extraction and efficiency, achieving state-of-the-art results.

Contribution

The paper proposes a four-stage framework combining pre-attentive and attentive processes, with a novel contrast operator for enhanced saliency detection in complex scenes.

Findings

01

Outperforms state-of-the-art methods on five datasets

02

Efficient training due to fixed backbone in GFE stage

03

High-order contrast operator improves semantic feature extraction

Abstract

It is believed that human vision system (HVS) consists of pre-attentive process and attention process when performing salient object detection (SOD). Based on this fact, we propose a four-stage framework for SOD, in which the first two stages match the \textbf{P}re-\textbf{A}ttentive process consisting of general feature extraction (GFE) and feature preprocessing (FP), and the last two stages are corresponding to \textbf{A}ttention process containing saliency feature extraction (SFE) and the feature aggregation (FA), namely \textbf{PAANet}. According to the pre-attentive process, the GFE stage applies the fully-trained backbone and needs no further finetuning for different datasets. This modification can greatly increase the training speed. The FP stage plays the role of finetuning but works more efficiently because of its simpler structure and fewer parameters. Moreover, in SFE stage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Olfactory and Sensory Function Studies

MethodsConvolution