Medical Image Segmentation Based on Multi-Modal Convolutional Neural Network: Study on Image Fusion Schemes
Zhe Guo, Xiang Li, Heng Huang, Ning Guo, Quanzheng Li

TL;DR
This paper proposes a unified CNN-based framework for multi-modal medical image segmentation, exploring different fusion schemes and demonstrating that feature-level fusion generally yields the best accuracy and efficiency.
Contribution
It introduces a conceptual architecture for multi-modal image fusion in biomedical analysis and implements three fusion schemes within a CNN framework for improved segmentation.
Findings
Feature-level fusion achieves the best accuracy and efficiency.
All fusion schemes outperform single-modality approaches.
Feature-level fusion is less robust to large modality errors.
Abstract
Image analysis using more than one modality (i.e. multi-modal) has been increasingly applied in the field of biomedical imaging. One of the challenges in performing the multimodal analysis is that there exist multiple schemes for fusing the information from different modalities, where such schemes are application-dependent and lack a unified framework to guide their designs. In this work we firstly propose a conceptual architecture for the image fusion schemes in supervised biomedical image analysis: fusing at the feature level, fusing at the classifier level, and fusing at the decision-making level. Further, motivated by the recent success in applying deep learning for natural image analysis, we implement the three image fusion schemes above based on the Convolutional Neural Network (CNN) with varied structures, and combined into a single framework. The proposed image segmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
