CAGNet: Content-Aware Guidance for Salient Object Detection

Sina Mohammadi; Mehrdad Noori; Ali Bahri; Sina Ghofrani Majelan,; Mohammad Havaei

arXiv:1911.13168·cs.CV·April 7, 2020

CAGNet: Content-Aware Guidance for Salient Object Detection

Sina Mohammadi, Mehrdad Noori, Ali Bahri, Sina Ghofrani Majelan,, Mohammad Havaei

PDF

3 Repos

TL;DR

CAGNet introduces a content-aware guidance framework with multi-scale feature extraction that significantly improves salient object detection accuracy, robustness, and speed across challenging scenarios and datasets.

Contribution

The paper proposes a novel Feature Guide Network with multi-scale feature extraction and a new loss function, enhancing salient object detection with fewer parameters and real-time performance.

Findings

01

Achieves state-of-the-art results on five challenging datasets.

02

Operates at 28 FPS with fewer parameters than existing methods.

03

Effectively handles complex scenarios with salient-like backgrounds.

Abstract

Beneficial from Fully Convolutional Neural Networks (FCNs), saliency detection methods have achieved promising results. However, it is still challenging to learn effective features for detecting salient objects in complicated scenarios, in which i) non-salient regions may have "salient-like" appearance; ii) the salient objects may have different-looking regions. To handle these complex scenarios, we propose a Feature Guide Network which exploits the nature of low-level and high-level features to i) make foreground and background regions more distinct and suppress the non-salient regions which have "salient-like" appearance; ii) assign foreground label to different-looking salient regions. Furthermore, we utilize a Multi-scale Feature Extraction Module (MFEM) for each level of abstraction to obtain multi-scale contextual information. Finally, we design a loss function which outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings