Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis

Haowen Bai; Zixiang Zhao; Jiangshe Zhang; Baisong Jiang; Lilun Deng,; Yukun Cui; Shuang Xu; Chunxia Zhang

arXiv:2502.01467·cs.CV·February 4, 2025

Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis

Haowen Bai, Zixiang Zhao, Jiangshe Zhang, Baisong Jiang, Lilun Deng,, Yukun Cui, Shuang Xu, Chunxia Zhang

PDF

TL;DR

This paper introduces UAAFusion, a novel deep unfolding network that uses attribution analysis to enhance multi-modal image fusion specifically for semantic segmentation, improving the interaction between fusion and segmentation tasks.

Contribution

We propose a model-driven unfolding network that incorporates attribution analysis and attention mechanisms to optimize fused images for semantic segmentation, with a new pathway function and memory augmentation.

Findings

01

Outperforms existing methods in image fusion quality

02

Enhances segmentation accuracy through attribution-guided fusion

03

Demonstrates robustness across multiple datasets

Abstract

Multi-modal image fusion synthesizes information from multiple sources into a single image, facilitating downstream tasks such as semantic segmentation. Current approaches primarily focus on acquiring informative fusion images at the visual display stratum through intricate mappings. Although some approaches attempt to jointly optimize image fusion and downstream tasks, these efforts often lack direct guidance or interaction, serving only to assist with a predefined fusion loss. To address this, we propose an ``Unfolding Attribution Analysis Fusion network'' (UAAFusion), using attribution analysis to tailor fused images more effectively for semantic segmentation, enhancing the interaction between the fusion and segmentation. Specifically, we utilize attribution analysis techniques to explore the contributions of semantic regions in the source images to task discrimination. At the same…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.