Weakly-Supervised Multimodal Learning on MIMIC-CXR
Andrea Agostini, Daphn\'e Chopard, Yang Meng, Norbert Fortin, Babak, Shahbaba, Stephan Mandt, Thomas M. Sutter, Julia E. Vogt

TL;DR
This paper evaluates the Multimodal Variational Mixture-of-Experts VAE on MIMIC-CXR, showing it outperforms other models and fully supervised methods, addressing challenges in multimodal medical data integration and label scarcity.
Contribution
It introduces and thoroughly evaluates the MMVM VAE, demonstrating its superior performance in multimodal medical imaging tasks.
Findings
MMVM VAE outperforms other multimodal VAEs
MMVM VAE surpasses fully supervised approaches
Demonstrates potential for real-world medical applications
Abstract
Multimodal data integration and label scarcity pose significant challenges for machine learning in medical settings. To address these issues, we conduct an in-depth evaluation of the newly proposed Multimodal Variational Mixture-of-Experts (MMVM) VAE on the challenging MIMIC-CXR dataset. Our analysis demonstrates that the MMVM VAE consistently outperforms other multimodal VAEs and fully supervised approaches, highlighting its strong potential for real-world medical applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Computational Techniques and Applications
