Robust RGB-D Fusion for Saliency Detection
Zongwei Wu, Shriarulmozhivarman Gobichettipalayam, Brahim Tamadazte,, Guillaume Allibert, Danda Pani Paudel, C\'edric Demonceaux

TL;DR
This paper introduces a robust RGB-D fusion approach for saliency detection that effectively handles low-quality depth data by using layer-wise and trident spatial attention mechanisms, improving performance over existing methods.
Contribution
The paper presents a novel RGB-D fusion method with layer-wise and trident spatial attention mechanisms that enhance robustness to low-quality depth data in saliency detection.
Findings
Outperforms state-of-the-art fusion methods on five benchmark datasets.
Effectively handles noisy and misaligned depth data.
Improves saliency detection accuracy with robust multi-modal fusion.
Abstract
Efficiently exploiting multi-modal inputs for accurate RGB-D saliency detection is a topic of high interest. Most existing works leverage cross-modal interactions to fuse the two streams of RGB-D for intermediate features' enhancement. In this process, a practical aspect of the low quality of the available depths has not been fully considered yet. In this work, we aim for RGB-D saliency detection that is robust to the low-quality depths which primarily appear in two forms: inaccuracy due to noise and the misalignment to RGB. To this end, we propose a robust RGB-D fusion method that benefits from (1) layer-wise, and (2) trident spatial, attention mechanisms. On the one hand, layer-wise attention (LWA) learns the trade-off between early and late fusion of RGB and depth features, depending upon the depth accuracy. On the other hand, trident spatial attention (TSA) aggregates the features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Industrial Vision Systems and Defect Detection
