CFCML: A Coarse-to-Fine Crossmodal Learning Framework For Disease Diagnosis Using Multimodal Images and Tabular Data
Tianling Liu, Hongying Liu, Fanhua Shang, Lequan Yu, Tong Han, Liang Wan

TL;DR
This paper introduces CFCML, a novel framework that progressively reduces the modality gap between multimodal images and tabular data for improved disease diagnosis accuracy, leveraging hierarchical prototype-based contrastive learning.
Contribution
The paper proposes a coarse-to-fine crossmodal learning framework that explores local and high-level relationships, incorporating hierarchical prototype-based contrastive learning to enhance diagnostic performance.
Findings
Outperforms state-of-the-art methods on MEN and Derm7pt datasets.
Achieves 1.53% and 0.91% improvements in AUC metrics.
Effectively reduces modality gap and enhances crossmodal feature discrimination.
Abstract
In clinical practice, crossmodal information including medical images and tabular data is essential for disease diagnosis. There exists a significant modality gap between these data types, which obstructs advancements in crossmodal diagnostic accuracy. Most existing crossmodal learning (CML) methods primarily focus on exploring relationships among high-level encoder outputs, leading to the neglect of local information in images. Additionally, these methods often overlook the extraction of task-relevant information. In this paper, we propose a novel coarse-to-fine crossmodal learning (CFCML) framework to progressively reduce the modality gap between multimodal images and tabular data, by thoroughly exploring inter-modal relationships. At the coarse stage, we explore the relationships between multi-granularity features from various image encoder stages and tabular information,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Machine Learning in Healthcare · Domain Adaptation and Few-Shot Learning
