MMA-UNet: A Multi-Modal Asymmetric UNet Architecture for Infrared and Visible Image Fusion
Jingxue Huang, Xilai Li, Tianshu Tan, Xiaosong Li, Tao Ye

TL;DR
This paper introduces MMA-UNet, an innovative asymmetric neural network architecture designed to improve infrared and visible image fusion by maintaining balanced, deep feature space alignment across modalities, resulting in superior fusion quality.
Contribution
The paper proposes a novel asymmetric UNet architecture with separate encoders and cross-scale fusion for multi-modal image fusion, addressing limitations of symmetric fusion methods.
Findings
MMA-UNet outperforms state-of-the-art fusion methods in quality.
The approach produces visually natural and semantically rich fused images.
Extensive experiments validate the effectiveness of the proposed method.
Abstract
Multi-modal image fusion (MMIF) maps useful information from various modalities into the same representation space, thereby producing an informative fused image. However, the existing fusion algorithms tend to symmetrically fuse the multi-modal images, causing the loss of shallow information or bias towards a single modality in certain regions of the fusion results. In this study, we analyzed the spatial distribution differences of information in different modalities and proved that encoding features within the same network is not conducive to achieving simultaneous deep feature space alignment for multi-modal images. To overcome this issue, a Multi-Modal Asymmetric UNet (MMA-UNet) was proposed. We separately trained specialized feature encoders for different modal and implemented a cross-scale fusion strategy to maintain the features from different modalities within the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Infrared Target Detection Methodologies · Remote-Sensing Image Classification
