ACFNet: Adaptively-Cooperative Fusion Network for RGB-D Salient Object Detection
Jinchao Zhu

TL;DR
ACFNet is a novel RGB-D salient object detection network that adaptively combines early and late fusion strategies, using semantic guidance and attention modules to improve accuracy and robustness across diverse scenarios.
Contribution
The paper introduces ACFNet with ResinRes structure, an adaptively-cooperative fusion scheme, and a type-based attention module, advancing multi-stage feature fusion and semantic guidance in RGB-D saliency detection.
Findings
Outperforms 18 state-of-the-art algorithms on RGB-D SOD datasets.
Effectively balances early and late fusion advantages for diverse object scenarios.
Enhances multi-scale perception and feature transfer through novel modules.
Abstract
The reasonable employment of RGB and depth data show great significance in promoting the development of computer vision tasks and robot-environment interaction. However, there are different advantages and disadvantages in the early and late fusion of the two types of data. Besides, due to the diversity of object information, using a single type of data in a specific scenario tends to result in semantic misleading. Based on the above considerations, we propose an adaptively-cooperative fusion network (ACFNet) with ResinRes structure for salient object detection. This structure is designed to flexibly utilize the advantages of feature fusion in early and late stages. Secondly, an adaptively-cooperative semantic guidance (ACG) scheme is designed to suppress inaccurate features in the guidance phase. Further, we proposed a type-based attention module (TAM) to optimize the network and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
MethodsTemporal Adaptive Module · Convolution
