TL;DR
This paper introduces EDN, a novel salient object detection network that uses extreme downsampling to capture global context and a specialized decoder for detail recovery, achieving state-of-the-art results in real-time.
Contribution
The paper proposes a new EDN architecture with extreme downsampling and a scale-correlated pyramid convolution for improved salient object detection.
Findings
EDN achieves state-of-the-art performance on SOD benchmarks.
EDN-Lite runs at 316fps with competitive accuracy.
Enhancing high-level features is crucial for effective SOD.
Abstract
Recent progress on salient object detection (SOD) mainly benefits from multi-scale learning, where the high-level and low-level features collaborate in locating salient objects and discovering fine details, respectively. However, most efforts are devoted to low-level feature learning by fusing multi-scale features or enhancing boundary representations. High-level features, which although have long proven effective for many other tasks, yet have been barely studied for SOD. In this paper, we tap into this gap and show that enhancing high- level features is essential for SOD as well. To this end, we introduce an Extremely-Downsampled Network (EDN), which employs an extreme downsampling technique to effectively learn a global view of the whole image, leading to accurate salient object localization. To accomplish better multi-level feature fusion, we construct the Scale-Correlated Pyramid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
