Residual Spatial Fusion Network for RGB-Thermal Semantic Segmentation
Ping Li, Junjie Chen, Binbin Lin, Xianghua Xu

TL;DR
This paper introduces RSFNet, a novel asymmetric encoder-decoder architecture with a residual spatial fusion module for RGB-T semantic segmentation, effectively handling modality differences and lighting variations.
Contribution
The work proposes an asymmetric encoder and a residual spatial fusion module with structural re-parameterization for improved RGB-T segmentation.
Findings
Achieves state-of-the-art performance on MFNet and PST900 benchmarks.
Effectively fuses RGB and thermal features with adaptive spatial weights.
Balances segmentation accuracy and computational speed.
Abstract
Semantic segmentation plays an important role in widespread applications such as autonomous driving and robotic sensing. Traditional methods mostly use RGB images which are heavily affected by lighting conditions, \eg, darkness. Recent studies show thermal images are robust to the night scenario as a compensating modality for segmentation. However, existing works either simply fuse RGB-Thermal (RGB-T) images or adopt the encoder with the same structure for both the RGB stream and the thermal stream, which neglects the modality difference in segmentation under varying lighting conditions. Therefore, this work proposes a Residual Spatial Fusion Network (RSFNet) for RGB-T semantic segmentation. Specifically, we employ an asymmetric encoder to learn the compensating features of the RGB and the thermal images. To effectively fuse the dual-modality features, we generate the pseudo-labels by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image Fusion Techniques
MethodsHierarchical Feature Fusion · Residual Connection
