A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion
Xiaoli Zhang, Liying Wang, Libo Zhao, Xiongfei Li, Siwei Ma

TL;DR
This paper introduces a novel multi-branch network that models inter-feature relationships and high-level graph reasoning to improve infrared-visible image fusion, enhancing downstream tasks like object detection and segmentation.
Contribution
It proposes a three-branch encoder-decoder architecture with graph reasoning and cross attention modules for better multi-modality feature fusion.
Findings
Outperforms state-of-the-art in image fusion tasks.
Achieves higher mAP and mIoU in object detection and segmentation.
Effectively preserves high-frequency details and modality-shared information.
Abstract
Multi-modality image fusion aims at fusing modality-specific (complementarity) and modality-shared (correlation) information from multiple source images. To tackle the problem of the neglect of inter-feature relationships, high-frequency information loss, and the limited attention to downstream tasks, this paper focuses on how to model correlation-driven decomposing features and reason high-level graph representation by efficiently extracting complementary information and aggregating multi-guided features. We propose a three-branch encoder-decoder architecture along with corresponding fusion layers as the fusion strategy. Firstly, shallow features from individual modalities are extracted by a depthwise convolution layer combined with the transformer block. In the three parallel branches of the encoder, Cross Attention and Invertible Block (CAI) extracts local features and preserves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Remote-Sensing Image Classification · Image Enhancement Techniques
MethodsDepthwise Convolution · Convolution · Balanced Selection
