DC-Net: Divide-and-Conquer for Salient Object Detection
Jiayi Zhu, Xuebin Qin, Abdulmotaleb Elsaddik

TL;DR
DC-Net introduces a divide-and-conquer approach with specialized encoders and a novel decoder to improve salient object detection, achieving high efficiency and competitive accuracy across multiple datasets.
Contribution
The paper proposes a novel divide-and-conquer network architecture with parallel acceleration for efficient and accurate salient object detection.
Findings
Achieves 60 FPS and 55 FPS on LR-SOD and HR-SOD datasets.
Effectively captures multi-scale features with ResASPP$^{2}$ modules.
Outperforms existing methods on multiple benchmark datasets.
Abstract
In this paper, we introduce Divide-and-Conquer into the salient object detection (SOD) task to enable the model to learn prior knowledge that is for predicting the saliency map. We design a novel network, Divide-and-Conquer Network (DC-Net) which uses two encoders to solve different subtasks that are conducive to predicting the final saliency map, here is to predict the edge maps with width 4 and location maps of salient objects and then aggregate the feature maps with different semantic information into the decoder to predict the final saliency map. The decoder of DC-Net consists of our newly designed two-level Residual nested-ASPP (ResASPP) modules, which have the ability to capture a large number of different scale features with a small number of convolution operations and have the advantages of maintaining high resolution all the time and being able to obtain a large and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution
