Weakly-Supervised Multimodal Learning on MIMIC-CXR

Andrea Agostini; Daphn\'e Chopard; Yang Meng; Norbert Fortin; Babak; Shahbaba; Stephan Mandt; Thomas M. Sutter; Julia E. Vogt

arXiv:2411.10356·cs.LG·November 18, 2024

Weakly-Supervised Multimodal Learning on MIMIC-CXR

Andrea Agostini, Daphn\'e Chopard, Yang Meng, Norbert Fortin, Babak, Shahbaba, Stephan Mandt, Thomas M. Sutter, Julia E. Vogt

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the Multimodal Variational Mixture-of-Experts VAE on MIMIC-CXR, showing it outperforms other models and fully supervised methods, addressing challenges in multimodal medical data integration and label scarcity.

Contribution

It introduces and thoroughly evaluates the MMVM VAE, demonstrating its superior performance in multimodal medical imaging tasks.

Findings

01

MMVM VAE outperforms other multimodal VAEs

02

MMVM VAE surpasses fully supervised approaches

03

Demonstrates potential for real-world medical applications

Abstract

Multimodal data integration and label scarcity pose significant challenges for machine learning in medical settings. To address these issues, we conduct an in-depth evaluation of the newly proposed Multimodal Variational Mixture-of-Experts (MMVM) VAE on the challenging MIMIC-CXR dataset. Our analysis demonstrates that the MMVM VAE consistently outperforms other multimodal VAEs and fully supervised approaches, highlighting its strong potential for real-world medical applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

agostini335/mmvmvae-mimic
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Computational Techniques and Applications