SMFusion: Semantic-Preserving Fusion of Multimodal Medical Images for Enhanced Clinical Diagnosis
Haozhe Xiang, Han Zhang, Yu Cheng, Xiongwen Quan, Wanwan Huang

TL;DR
This paper introduces a novel semantic-guided multimodal medical image fusion method that incorporates medical prior knowledge and textual descriptions to improve diagnostic information retention and image quality.
Contribution
It presents the first approach to integrate medical prior knowledge and text descriptions into image fusion, enhancing semantic preservation and diagnostic utility.
Findings
Achieves superior qualitative and quantitative fusion performance.
Preserves more critical medical information in fused images.
Effectively aligns textual and visual features using semantic interaction.
Abstract
Multimodal medical image fusion plays a crucial role in medical diagnosis by integrating complementary information from different modalities to enhance image readability and clinical applicability. However, existing methods mainly follow computer vision standards for feature extraction and fusion strategy formulation, overlooking the rich semantic information inherent in medical images. To address this limitation, we propose a novel semantic-guided medical image fusion approach that, for the first time, incorporates medical prior knowledge into the fusion process. Specifically, we construct a publicly available multimodal medical image-text dataset, upon which text descriptions generated by BiomedGPT are encoded and semantically aligned with image features in a high-dimensional space via a semantic interaction alignment module. During this process, a cross attention based linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Image Enhancement Techniques · Generative Adversarial Networks and Image Synthesis
MethodsSoftmax · Attention Is All You Need
