Deep Unfolding Multi-modal Image Fusion Network via Attribution Analysis
Haowen Bai, Zixiang Zhao, Jiangshe Zhang, Baisong Jiang, Lilun Deng,, Yukun Cui, Shuang Xu, Chunxia Zhang

TL;DR
This paper introduces UAAFusion, a novel deep unfolding network that uses attribution analysis to enhance multi-modal image fusion specifically for semantic segmentation, improving the interaction between fusion and segmentation tasks.
Contribution
We propose a model-driven unfolding network that incorporates attribution analysis and attention mechanisms to optimize fused images for semantic segmentation, with a new pathway function and memory augmentation.
Findings
Outperforms existing methods in image fusion quality
Enhances segmentation accuracy through attribution-guided fusion
Demonstrates robustness across multiple datasets
Abstract
Multi-modal image fusion synthesizes information from multiple sources into a single image, facilitating downstream tasks such as semantic segmentation. Current approaches primarily focus on acquiring informative fusion images at the visual display stratum through intricate mappings. Although some approaches attempt to jointly optimize image fusion and downstream tasks, these efforts often lack direct guidance or interaction, serving only to assist with a predefined fusion loss. To address this, we propose an ``Unfolding Attribution Analysis Fusion network'' (UAAFusion), using attribution analysis to tailor fused images more effectively for semantic segmentation, enhancing the interaction between the fusion and segmentation. Specifically, we utilize attribution analysis techniques to explore the contributions of semantic regions in the source images to task discrimination. At the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
