CAR-MFL: Cross-Modal Augmentation by Retrieval for Multimodal Federated Learning with Missing Modalities
Pranav Poudel, Prashant Shrestha, Sanskar Amgain, Yash Raj Shrestha,, Prashnna Gyawali, Binod Bhattarai

TL;DR
This paper introduces CAR-MFL, a cross-modal retrieval-based data augmentation method for multimodal federated learning in healthcare, effectively handling missing modalities and enhancing performance while preserving privacy.
Contribution
It proposes a novel cross-modal augmentation technique using retrieval to fill missing modalities in federated learning, improving medical multimodal analysis.
Findings
Outperforms several baseline methods on medical multimodal benchmarks
Effectively handles missing modalities in federated learning scenarios
Ensures privacy preservation during model training
Abstract
Multimodal AI has demonstrated superior performance over unimodal approaches by leveraging diverse data sources for more comprehensive analysis. However, applying this effectiveness in healthcare is challenging due to the limited availability of public datasets. Federated learning presents an exciting solution, allowing the use of extensive databases from hospitals and health centers without centralizing sensitive data, thus maintaining privacy and security. Yet, research in multimodal federated learning, particularly in scenarios with missing modalities a common issue in healthcare datasets remains scarce, highlighting a critical area for future exploration. Toward this, we propose a novel method for multimodal federated learning with missing modalities. Our contribution lies in a novel cross-modal data augmentation by retrieval, leveraging the small publicly available dataset to fill…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
