ACNet: Attention Based Network to Exploit Complementary Features for RGBD Semantic Segmentation
Xinxin Hu, Kailun Yang, Lei Fei, Kaiwei Wang

TL;DR
ACNet introduces an attention-based architecture with a novel module to effectively fuse RGB and depth features, significantly improving semantic segmentation performance on RGBD datasets.
Contribution
The paper proposes the Attention Complementary Module and a multi-branch architecture to better exploit RGBD features for segmentation.
Findings
Achieves 48.3% mIoU on NYUDv2 with ResNet50.
Outperforms state-of-the-art RGBD segmentation methods.
Demonstrates effective feature fusion using attention mechanisms.
Abstract
Compared to RGB semantic segmentation, RGBD semantic segmentation can achieve better performance by taking depth information into consideration. However, it is still problematic for contemporary segmenters to effectively exploit RGBD information since the feature distributions of RGB and depth (D) images vary significantly in different scenes. In this paper, we propose an Attention Complementary Network (ACNet) that selectively gathers features from RGB and depth branches. The main contributions lie in the Attention Complementary Module (ACM) and the architecture with three parallel branches. More precisely, ACM is a channel attention-based module that extracts weighted features from RGB and depth branches. The architecture preserves the inference of the original RGB and depth branches, and enables the fusion branch at the same time. Based on the above structures, ACNet is capable of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Medical Image Segmentation Techniques
