FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time   Semantic Segmentation

Fuqin Deng; Hua Feng; Mingjian Liang; Hongmin Wang; Yong Yang; Yuan; Gao; Junfeng Chen; Junjie Hu; Xiyue Guo; and Tin Lun Lam

arXiv:2110.08988·cs.CV·October 19, 2021·6 cites

FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation

Fuqin Deng, Hua Feng, Mingjian Liang, Hongmin Wang, Yong Yang, Yuan, Gao, Junfeng Chen, Junjie Hu, Xiyue Guo, and Tin Lun Lam

PDF

Open Access 1 Repo

TL;DR

FEANet is a two-stage neural network that enhances feature extraction using attention modules to improve real-time RGB-Thermal semantic segmentation, outperforming state-of-the-art methods in accuracy and speed.

Contribution

The paper introduces a novel Feature-Enhanced Attention Module (FEAM) that improves spatial detail preservation and attention focus in RGB-T segmentation.

Findings

01

Outperforms SOTA methods by +2.6% in global mAcc and +0.8% in global mIoU.

02

Achieves real-time inference speed on NVIDIA GeForce RTX 2080 Ti.

03

Effectively preserves spatial information and enhances high-resolution feature focus.

Abstract

The RGB-Thermal (RGB-T) information for semantic segmentation has been extensively explored in recent years. However, most existing RGB-T semantic segmentation usually compromises spatial resolution to achieve real-time inference speed, which leads to poor performance. To better extract detail spatial information, we propose a two-stage Feature-Enhanced Attention Network (FEANet) for the RGB-T semantic segmentation task. Specifically, we introduce a Feature-Enhanced Attention Module (FEAM) to excavate and enhance multi-level features from both the channel and spatial views. Benefited from the proposed FEAM module, our FEANet can preserve the spatial information and shift more attention to high-resolution features from the fused RGB-T images. Extensive experiments on the urban scene dataset demonstrate that our FEANet outperforms other state-of-the-art (SOTA) RGB-T methods in terms of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

matrixgame2018/FEANet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Video Surveillance and Tracking Methods

MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings