Recursive Multi-model Complementary Deep Fusion forRobust Salient Object   Detection via Parallel Sub Networks

Zhenyu Wu; Shuai Li; Chenglizhao Chen; Aimin Hao; Hong Qin

arXiv:2008.04158·cs.CV·August 11, 2020

Recursive Multi-model Complementary Deep Fusion forRobust Salient Object Detection via Parallel Sub Networks

Zhenyu Wu, Shuai Li, Chenglizhao Chen, Aimin Hao, Hong Qin

PDF

Open Access 1 Repo

TL;DR

This paper introduces a wider, parallel sub-network architecture with dense interactions for salient object detection, achieving superior performance by enhancing feature diversity and complementarity.

Contribution

It proposes a novel wider network with parallel sub-networks and dense short-connections to improve feature diversity and complementarity in salient object detection.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets.

02

Demonstrates strong generalization across different datasets.

03

Shows effective feature fusion improves detection accuracy.

Abstract

Fully convolutional networks have shown outstanding performance in the salient object detection (SOD) field. The state-of-the-art (SOTA) methods have a tendency to become deeper and more complex, which easily homogenize their learned deep features, resulting in a clear performance bottleneck. In sharp contrast to the conventional ``deeper'' schemes, this paper proposes a ``wider'' network architecture which consists of parallel sub networks with totally different network architectures. In this way, those deep features obtained via these two sub networks will exhibit large diversity, which will have large potential to be able to complement with each other. However, a large diversity may easily lead to the feature conflictions, thus we use the dense short-connections to enable a recursively interaction between the parallel sub networks, pursuing an optimal complementary status between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Diamond101010/RMMDF
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications