Multi-interactive Feature Learning and a Full-time Multi-modality   Benchmark for Image Fusion and Segmentation

Jinyuan Liu; Zhu Liu; Guanyao Wu; Long Ma; Risheng Liu; Wei Zhong,; Zhongxuan Luo; Xin Fan

arXiv:2308.02097·cs.CV·August 7, 2023·6 cites

Multi-interactive Feature Learning and a Full-time Multi-modality Benchmark for Image Fusion and Segmentation

Jinyuan Liu, Zhu Liu, Guanyao Wu, Long Ma, Risheng Liu, Wei Zhong,, Zhongxuan Luo, Xin Fan

PDF

Open Access 2 Repos

TL;DR

This paper introduces SegMiF, a multi-interactive feature learning architecture that jointly improves image fusion and segmentation performance using a cascade structure, hierarchical attention, and dynamic task weighting, validated on a new multi-modality benchmark.

Contribution

The paper proposes a novel multi-interactive architecture for simultaneous image fusion and segmentation, incorporating hierarchical attention and dynamic weighting, and provides a comprehensive multi-modality benchmark dataset.

Findings

01

Achieves 7.66% higher segmentation mIoU than state-of-the-art methods.

02

Produces visually appealing fused images.

03

Demonstrates effectiveness on public datasets and a new benchmark.

Abstract

Multi-modality image fusion and segmentation play a vital role in autonomous driving and robotic operation. Early efforts focus on boosting the performance for only one task, \emph{e.g.,} fusion or segmentation, making it hard to reach~`Best of Both Worlds'. To overcome this issue, in this paper, we propose a \textbf{M}ulti-\textbf{i}nteractive \textbf{F}eature learning architecture for image fusion and \textbf{Seg}mentation, namely SegMiF, and exploit dual-task correlation to promote the performance of both tasks. The SegMiF is of a cascade structure, containing a fusion sub-network and a commonly used segmentation sub-network. By slickly bridging intermediate features between two components, the knowledge learned from the segmentation task can effectively assist the fusion task. Also, the benefited fusion network supports the segmentation one to perform more pretentiously. Besides, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Visual Attention and Saliency Detection · Photoacoustic and Ultrasonic Imaging

MethodsFocus