SMFusion: Semantic-Preserving Fusion of Multimodal Medical Images for Enhanced Clinical Diagnosis

Haozhe Xiang; Han Zhang; Yu Cheng; Xiongwen Quan; Wanwan Huang

arXiv:2505.12251·cs.CV·May 20, 2025

SMFusion: Semantic-Preserving Fusion of Multimodal Medical Images for Enhanced Clinical Diagnosis

Haozhe Xiang, Han Zhang, Yu Cheng, Xiongwen Quan, Wanwan Huang

PDF

Open Access

TL;DR

This paper introduces a novel semantic-guided multimodal medical image fusion method that incorporates medical prior knowledge and textual descriptions to improve diagnostic information retention and image quality.

Contribution

It presents the first approach to integrate medical prior knowledge and text descriptions into image fusion, enhancing semantic preservation and diagnostic utility.

Findings

01

Achieves superior qualitative and quantitative fusion performance.

02

Preserves more critical medical information in fused images.

03

Effectively aligns textual and visual features using semantic interaction.

Abstract

Multimodal medical image fusion plays a crucial role in medical diagnosis by integrating complementary information from different modalities to enhance image readability and clinical applicability. However, existing methods mainly follow computer vision standards for feature extraction and fusion strategy formulation, overlooking the rich semantic information inherent in medical images. To address this limitation, we propose a novel semantic-guided medical image fusion approach that, for the first time, incorporates medical prior knowledge into the fusion process. Specifically, we construct a publicly available multimodal medical image-text dataset, upon which text descriptions generated by BiomedGPT are encoded and semantically aligned with image features in a high-dimensional space via a semantic interaction alignment module. During this process, a cross attention based linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Image Enhancement Techniques · Generative Adversarial Networks and Image Synthesis

MethodsSoftmax · Attention Is All You Need