Navigating the landscape of multimodal AI in medicine: a scoping review on technical challenges and clinical applications
Daan Schouten, Giulia Nicoletti, Bas Dille, Catherine Chia, Pierpaolo, Vendittelli, Megan Schuurmans, Geert Litjens, Nadieh Khalili

TL;DR
This review explores the development, challenges, and clinical applications of multimodal AI in medicine, highlighting its superior performance over unimodal models and discussing strategies for effective implementation.
Contribution
It provides a comprehensive overview of deep learning-based multimodal AI applications in healthcare, analyzing 432 papers and identifying key challenges and future directions.
Findings
Multimodal AI models outperform unimodal models by 6.2% in AUC.
Challenges include data heterogeneity and incomplete datasets.
Commercial multimodal AI models are emerging for clinical use.
Abstract
Recent technological advances in healthcare have led to unprecedented growth in patient data quantity and diversity. While artificial intelligence (AI) models have shown promising results in analyzing individual data modalities, there is increasing recognition that models integrating multiple complementary data sources, so-called multimodal AI, could enhance clinical decision-making. This scoping review examines the landscape of deep learning-based multimodal AI applications across the medical domain, analyzing 432 papers published between 2018 and 2024. We provide an extensive overview of multimodal AI development across different medical disciplines, examining various architectural approaches, fusion strategies, and common application areas. Our analysis reveals that multimodal AI models consistently outperform their unimodal counterparts, with an average improvement of 6.2 percentage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education
