Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection
Chen Zhang, Runmin Cong, Qinwei Lin, Lin Ma, Feng Li, Yao Zhao, Sam, Kwong

TL;DR
This paper introduces CDINet, a novel RGB-D salient object detection network that models cross-modality interactions differently at various feature layers, significantly improving detection accuracy.
Contribution
It proposes a new approach to model the dependence of RGB and depth modalities at different layers, with modules for detail and semantic enhancement, and a dense decoding structure.
Findings
Outperforms 15 state-of-the-art methods on five datasets.
Effectively enhances details and semantics through novel modules.
Achieves superior quantitative and qualitative results.
Abstract
The popularity and promotion of depth maps have brought new vigor and vitality into salient object detection (SOD), and a mass of RGB-D SOD algorithms have been proposed, mainly concentrating on how to better integrate cross-modality features from RGB image and depth map. For the cross-modality interaction in feature encoder, existing methods either indiscriminately treat RGB and depth modalities, or only habitually utilize depth cues as auxiliary information of the RGB branch. Different from them, we reconsider the status of two modalities and propose a novel Cross-modality Discrepant Interaction Network (CDINet) for RGB-D SOD, which differentially models the dependence of two modalities according to the feature representations of different layers. To this end, two components are designed to implement the effective cross-modality interaction: 1) the RGB-induced Detail Enhancement (RDE)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Face Recognition and Perception · Advanced Neural Network Applications
