FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation
Fuqin Deng, Hua Feng, Mingjian Liang, Hongmin Wang, Yong Yang, Yuan, Gao, Junfeng Chen, Junjie Hu, Xiyue Guo, and Tin Lun Lam

TL;DR
FEANet is a two-stage neural network that enhances feature extraction using attention modules to improve real-time RGB-Thermal semantic segmentation, outperforming state-of-the-art methods in accuracy and speed.
Contribution
The paper introduces a novel Feature-Enhanced Attention Module (FEAM) that improves spatial detail preservation and attention focus in RGB-T segmentation.
Findings
Outperforms SOTA methods by +2.6% in global mAcc and +0.8% in global mIoU.
Achieves real-time inference speed on NVIDIA GeForce RTX 2080 Ti.
Effectively preserves spatial information and enhances high-resolution feature focus.
Abstract
The RGB-Thermal (RGB-T) information for semantic segmentation has been extensively explored in recent years. However, most existing RGB-T semantic segmentation usually compromises spatial resolution to achieve real-time inference speed, which leads to poor performance. To better extract detail spatial information, we propose a two-stage Feature-Enhanced Attention Network (FEANet) for the RGB-T semantic segmentation task. Specifically, we introduce a Feature-Enhanced Attention Module (FEAM) to excavate and enhance multi-level features from both the channel and spatial views. Benefited from the proposed FEAM module, our FEANet can preserve the spatial information and shift more attention to high-resolution features from the fused RGB-T images. Extensive experiments on the urban scene dataset demonstrate that our FEANet outperforms other state-of-the-art (SOTA) RGB-T methods in terms of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Video Surveillance and Tracking Methods
MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
