Data-Level Recombination and Lightweight Fusion Scheme for RGB-D Salient   Object Detection

Xuehao Wang; Shuai Li; Chenglizhao Chen; Yuming Fang; Aimin Hao; Hong; Qin

arXiv:2009.05102·cs.CV·December 30, 2020

Data-Level Recombination and Lightweight Fusion Scheme for RGB-D Salient Object Detection

Xuehao Wang, Shuai Li, Chenglizhao Chen, Yuming Fang, Aimin Hao, Hong, Qin

PDF

1 Repo

TL;DR

This paper introduces a novel data-level recombination and lightweight triple-stream fusion approach for RGB-D salient object detection, outperforming existing bi-stream methods by optimally combining RGB and depth data before feature extraction.

Contribution

It proposes a new data recombination strategy and a lightweight triple-stream network to enhance RGB-D fusion, surpassing state-of-the-art bi-stream architectures.

Findings

01

Achieved new state-of-the-art performance on RGB-D salient object detection benchmarks.

02

Demonstrated improved fusion effectiveness through data-level recombination.

03

Reduced model complexity with a lightweight triple-stream design.

Abstract

Existing RGB-D salient object detection methods treat depth information as an independent component to complement its RGB part, and widely follow the bi-stream parallel network architecture. To selectively fuse the CNNs features extracted from both RGB and depth as a final result, the state-of-the-art (SOTA) bi-stream networks usually consist of two independent subbranches; i.e., one subbranch is used for RGB saliency and the other aims for depth saliency. However, its depth saliency is persistently inferior to the RGB saliency because the RGB component is intrinsically more informative than the depth component. The bi-stream architecture easily biases its subsequent fusion procedure to the RGB subbranch, leading to a performance bottleneck. In this paper, we propose a novel data-level recombination strategy to fuse RGB with D (depth) before deep feature extraction, where we cyclically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

XueHaoWang-Beijing/DRLF
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.