Multimodal, Multi-Disease Medical Imaging Foundation Model (MerMED-FM)
Yang Zhou, Chrystie Wan Ning Quek, Jun Zhou, Yan Wang, Yang Bai, Yuhe Ke, Jie Yao, Laura Gutierrez, Zhen Ling Teo, Darren Shu Jeng Ting, Brian T. Soetikno, Christopher S. Nielsen, Tobias Elze, Zengxiang Li, Linh Le Dinh, Lionel Tim-Ee Cheng, Tran Nguyen Tuan Anh

TL;DR
MerMED-FM is a comprehensive multimodal, multi-disease medical imaging foundation model trained on 3.3 million images, demonstrating high accuracy across various modalities and specialties, advancing AI's clinical utility in medicine.
Contribution
This work introduces MerMED-FM, a novel self-supervised, multimodal, multi-specialty foundation model trained on diverse datasets, outperforming existing models in medical imaging interpretation.
Findings
Achieved high AUROCs across multiple modalities, e.g., 0.988 for OCT and 0.982 for pathology.
Trained on 3.3 million images from over ten specialties and seven modalities.
Demonstrated strong performance across diverse diseases and imaging types.
Abstract
Current artificial intelligence models for medical imaging are predominantly single modality and single disease. Attempts to create multimodal and multi-disease models have resulted in inconsistent clinical accuracy. Furthermore, training these models typically requires large, labour-intensive, well-labelled datasets. We developed MerMED-FM, a state-of-the-art multimodal, multi-specialty foundation model trained using self-supervised learning and a memory module. MerMED-FM was trained on 3.3 million medical images from over ten specialties and seven modalities, including computed tomography (CT), chest X-rays (CXR), ultrasound (US), pathology patches, color fundus photography (CFP), optical coherence tomography (OCT) and dermatology images. MerMED-FM was evaluated across multiple diseases and compared against existing foundational models. Strong performance was achieved across all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · COVID-19 diagnosis using AI · Cutaneous Melanoma Detection and Management
