Explicit Attention-Enhanced Fusion for RGB-Thermal Perception Tasks
Mingjian Liang, Junjie Hu, Chenyu Bao, Hua Feng, Fuqin Deng, Tin, Lun Lam

TL;DR
This paper introduces Explicit Attention-Enhanced Fusion (EAEF), a novel method for effectively combining RGB and thermal data in perception tasks, significantly improving performance across multiple benchmarks.
Contribution
The paper proposes a new fusion strategy that adaptively enhances feature extraction for different data availability scenarios, outperforming existing methods in perception tasks.
Findings
Outperforms state-of-the-art by 1.6% in mIoU on semantic segmentation
Achieves 3.1% improvement in MAE on salient object detection
Increases mAP by 2.3% on object detection
Abstract
Recently, RGB-Thermal based perception has shown significant advances. Thermal information provides useful clues when visual cameras suffer from poor lighting conditions, such as low light and fog. However, how to effectively fuse RGB images and thermal data remains an open challenge. Previous works involve naive fusion strategies such as merging them at the input, concatenating multi-modality features inside models, or applying attention to each data modality. These fusion strategies are straightforward yet insufficient. In this paper, we propose a novel fusion method named Explicit Attention-Enhanced Fusion (EAEF) that fully takes advantage of each type of data. Specifically, we consider the following cases: i) both RGB data and thermal data, ii) only one of the types of data, and iii) none of them generate discriminative features. EAEF uses one branch to enhance feature extraction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Video Surveillance and Tracking Methods · Infrared Target Detection Methodologies
MethodsNone · Masked autoencoder
