MultiMed: Massively Multimodal and Multitask Medical Understanding
Shentong Mo, Paul Pu Liang

TL;DR
MultiMed is a comprehensive benchmark dataset designed to advance large-scale multimodal and multitask AI models for biomedical understanding, covering diverse data types and medical tasks to improve diagnosis, prognosis, and research.
Contribution
The paper introduces MultiMed, a large-scale, multimodal, and multitask biomedical dataset and benchmark, enabling comprehensive evaluation and development of AI models across diverse medical modalities and tasks.
Findings
Training across multiple modalities improves model robustness.
Multimodal models outperform unimodal counterparts on key tasks.
Large-scale multitask learning enhances generalization in biomedical AI.
Abstract
Biomedical data is inherently multimodal, consisting of electronic health records, medical imaging, digital pathology, genome sequencing, wearable sensors, and more. The application of artificial intelligence tools to these multifaceted sensing technologies has the potential to revolutionize the prognosis, diagnosis, and management of human health and disease. However, current approaches to biomedical AI typically only train and evaluate with one or a small set of medical modalities and tasks. This limitation hampers the development of comprehensive tools that can leverage the rich interconnected information across many heterogeneous biomedical sensors. To address this challenge, we present MultiMed, a benchmark designed to evaluate and enable large-scale learning across a wide spectrum of medical modalities and tasks. MultiMed consists of 2.56 million samples across ten medical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsSparse Evolutionary Training
