DC-Net: Divide-and-Conquer for Salient Object Detection

Jiayi Zhu; Xuebin Qin; Abdulmotaleb Elsaddik

arXiv:2305.14955·cs.CV·January 11, 2024·1 cites

DC-Net: Divide-and-Conquer for Salient Object Detection

Jiayi Zhu, Xuebin Qin, Abdulmotaleb Elsaddik

PDF

Open Access 1 Repo

TL;DR

DC-Net introduces a divide-and-conquer approach with specialized encoders and a novel decoder to improve salient object detection, achieving high efficiency and competitive accuracy across multiple datasets.

Contribution

The paper proposes a novel divide-and-conquer network architecture with parallel acceleration for efficient and accurate salient object detection.

Findings

01

Achieves 60 FPS and 55 FPS on LR-SOD and HR-SOD datasets.

02

Effectively captures multi-scale features with ResASPP$^{2}$ modules.

03

Outperforms existing methods on multiple benchmark datasets.

Abstract

In this paper, we introduce Divide-and-Conquer into the salient object detection (SOD) task to enable the model to learn prior knowledge that is for predicting the saliency map. We design a novel network, Divide-and-Conquer Network (DC-Net) which uses two encoders to solve different subtasks that are conducive to predicting the final saliency map, here is to predict the edge maps with width 4 and location maps of salient objects and then aggregate the feature maps with different semantic information into the decoder to predict the final saliency map. The decoder of DC-Net consists of our newly designed two-level Residual nested-ASPP (ResASPP $^{2}$ ) modules, which have the ability to capture a large number of different scale features with a small number of convolution operations and have the advantages of maintaining high resolution all the time and being able to obtain a large and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

piggyjerry/dc-net
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution