TL;DR
SFDFusion introduces a novel neural network that combines spatial and frequency domain information for infrared and visible image fusion, achieving high efficiency and superior visual quality suitable for real-time applications.
Contribution
The paper proposes a dual-modality refinement module and a frequency domain fusion module, integrating frequency domain information into image fusion for the first time with an emphasis on efficiency.
Findings
Outperforms existing methods in fusion quality metrics
Demonstrates high efficiency suitable for real-time applications
Enhances downstream detection performance
Abstract
Infrared and visible image fusion aims to utilize the complementary information from two modalities to generate fused images with prominent targets and rich texture details. Most existing algorithms only perform pixel-level or feature-level fusion from different modalities in the spatial domain. They usually overlook the information in the frequency domain, and some of them suffer from inefficiency due to excessively complex structures. To tackle these challenges, this paper proposes an efficient Spatial-Frequency Domain Fusion (SFDFusion) network for infrared and visible image fusion. First, we propose a Dual-Modality Refinement Module (DMRM) to extract complementary information. This module extracts useful information from both the infrared and visible modalities in the spatial domain and enhances fine-grained spatial details. Next, to introduce frequency domain information, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
