Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation
Jinyuan Liu, Zhu Liu, Guanyao Wu, Long Ma, Risheng Liu, Wei Zhong,, Zhongxuan Luo, Xin Fan

TL;DR
This paper introduces SegMiF, a multi-interactive feature learning architecture that jointly improves image fusion and segmentation performance using a cascade structure, hierarchical attention, and dynamic task weighting, validated on a new multi-modality benchmark.
Contribution
The paper proposes a novel multi-interactive architecture for simultaneous image fusion and segmentation, incorporating hierarchical attention and dynamic weighting, and provides a comprehensive multi-modality benchmark dataset.
Findings
Achieves 7.66% higher segmentation mIoU than state-of-the-art methods.
Produces visually appealing fused images.
Demonstrates effectiveness on public datasets and a new benchmark.
Abstract
Multi-modality image fusion and segmentation play a vital role in autonomous driving and robotic operation. Early efforts focus on boosting the performance for only one task, \emph{e.g.,} fusion or segmentation, making it hard to reach~`Best of Both Worlds'. To overcome this issue, in this paper, we propose a \textbf{M}ulti-\textbf{i}nteractive \textbf{F}eature learning architecture for image fusion and \textbf{Seg}mentation, namely SegMiF, and exploit dual-task correlation to promote the performance of both tasks. The SegMiF is of a cascade structure, containing a fusion sub-network and a commonly used segmentation sub-network. By slickly bridging intermediate features between two components, the knowledge learned from the segmentation task can effectively assist the fusion task. Also, the benefited fusion network supports the segmentation one to perform more pretentiously. Besides, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Visual Attention and Saliency Detection · Photoacoustic and Ultrasonic Imaging
MethodsFocus
