Towards Accurate RGB-D Saliency Detection with Complementary Attention and Adaptive Integration
Hong-Bo Bi, Zi-Qi Liu, Kang Wang, Bo Dong, Geng Chen, Ji-Quan Ma

TL;DR
This paper introduces CAAI-Net, a novel RGB-D saliency detection model that employs complementary attention and adaptive feature fusion, significantly improving accuracy by effectively integrating RGB and depth information.
Contribution
The paper proposes a unified framework with novel CCA and AFI modules for enhanced RGB-D saliency detection, addressing depth map quality issues and improving feature integration.
Findings
Outperforms nine state-of-the-art models on six benchmark datasets
Effective suppression of background disturbances in salient object detection
Demonstrates the effectiveness of proposed CCA and AFI modules through ablation studies
Abstract
Saliency detection based on the complementary information from RGB images and depth maps has recently gained great popularity. In this paper, we propose Complementary Attention and Adaptive Integration Network (CAAI-Net), a novel RGB-D saliency detection model that integrates complementary attention based feature concentration and adaptive cross-modal feature fusion into a unified framework for accurate saliency detection. Specifically, we propose a context-aware complementary attention (CCA) module, which consists of a feature interaction component, a complementary attention component, and a global-context component. The CCA module first utilizes the feature interaction component to extract rich local context features. The resulting features are then fed into the complementary attention component, which employs the complementary attention generated from adjacent levels to guide the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Image and Video Quality Assessment · Advanced Neural Network Applications
