Multimodal learning for scalable representation of high-dimensional medical data
Areej Alsaafin, Abubakr Shafique, Saghir Alfasly, Krishna R. Kalari, H. R. Tizhoosh

TL;DR
This paper introduces MarbliX, a new AI framework that combines medical images and genomic data to improve cancer diagnostics and patient similarity analysis.
Contribution
MarbliX is a self-supervised multimodal learning framework that generates compact binary codes for scalable and interpretable medical data integration.
Findings
MarbliX outperforms unimodal approaches in lung cancer with 85%–89% evaluation metrics.
In kidney cancer, real-valued monograms achieve F1: 80%–83% and Accuracy: 87%–90%.
Binary monograms enable efficient retrieval and case-based reasoning in a unified latent space.
Abstract
Integrating artificial intelligence (AI) with healthcare data is rapidly transforming medical diagnostics and driving progress toward precision medicine. However, effectively leveraging multimodal data, particularly digital pathology whole slide images (WSIs) and genomic sequencing, remains a significant challenge due to the intrinsic heterogeneity of these modalities and the need for scalable and interpretable frameworks. Existing diagnostic models typically operate on unimodal data, overlooking critical cross-modal interactions that can yield richer clinical insights. We introduce MarbliX (Multimodal Association and Retrieval with Binary Latent Indexed matriX), a self-supervised framework that learns to embed WSIs and immunogenomic profiles into compact, scalable binary codes, termed “monogram.” By optimizing a triplet contrastive objective across modalities, MarbliX captures…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Digital Imaging for Blood Diseases · Single-cell and spatial transcriptomics
