Modal-Adaptive Gated Recoding Network for RGB-D Salient Object Detection
Jinchao Zhu, Xiaoyu Zhang, Xian Fang, Feng Dong, Qiu Yu

TL;DR
This paper introduces GRNet, a novel RGB-D salient object detection framework that adaptively balances multi-modal information during feature fusion, improving robustness and performance on multiple benchmarks.
Contribution
The paper proposes a modal-adaptive gating mechanism and a recoding mixer to effectively balance and integrate multi-modal features in RGB-D salient object detection.
Findings
Outperforms 9 state-of-the-art methods on eight benchmarks.
Effectively suppresses invalid modal information during feature fusion.
Demonstrates improved robustness and accuracy in real-world scenarios.
Abstract
The multi-modal salient object detection model based on RGB-D information has better robustness in the real world. However, it remains nontrivial to better adaptively balance effective multi-modal information in the feature fusion phase. In this letter, we propose a novel gated recoding network (GRNet) to evaluate the information validity of the two modes, and balance their influence. Our framework is divided into three phases: perception phase, recoding mixing phase and feature integration phase. First, A perception encoder is adopted to extract multi-level single-modal features, which lays the foundation for multi-modal semantic comparative analysis. Then, a modal-adaptive gate unit (MGU) is proposed to suppress the invalid information and transfer the effective modal features to the recoding mixer and the hybrid branch decoder. The recoding mixer is responsible for recoding and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Multimodal Machine Learning Applications
