Spatial-Frequency Enhanced Mamba for Multi-Modal Image Fusion
Hui Sun, Long Lv, Pingping Zhang, Tongdan Tang, Feng Tian, Weibing Sun, Huchuan Lu

TL;DR
This paper introduces SFMFusion, a novel multi-modal image fusion framework that enhances Mamba with spatial and frequency domain features, achieving superior results over existing methods.
Contribution
The paper proposes a new three-branch structure and specialized blocks to improve feature extraction and fusion in MMIF using Mamba, incorporating spatial-frequency enhancement and dynamic fusion.
Findings
Outperforms most state-of-the-art methods on six datasets
Effective integration of spatial and frequency domain features
Demonstrates the advantages of the proposed blocks in MMIF
Abstract
Multi-Modal Image Fusion (MMIF) aims to integrate complementary image information from different modalities to produce informative images. Previous deep learning-based MMIF methods generally adopt Convolutional Neural Networks (CNNs) or Transformers for feature extraction. However, these methods deliver unsatisfactory performances due to the limited receptive field of CNNs and the high computational cost of Transformers. Recently, Mamba has demonstrated a powerful potential for modeling long-range dependencies with linear complexity, providing a promising solution to MMIF. Unfortunately, Mamba lacks full spatial and frequency perceptions, which are very important for MMIF. Moreover, employing Image Reconstruction (IR) as an auxiliary task has been proven beneficial for MMIF. However, a primary challenge is how to leverage IR efficiently and effectively. To address the above issues, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Advanced Image Processing Techniques · Remote-Sensing Image Classification
