Multimodal Federated Learning: A Survey through the Lens of Different FL Paradigms
Yuanzhe Peng, Jieming Bian, Lei Wang, Yin Huang, Jie Xu

TL;DR
This survey systematically analyzes Multimodal Federated Learning across different FL paradigms, highlighting unique challenges and providing a taxonomy to guide future research in this emerging field.
Contribution
It offers the first comprehensive taxonomy of MFL through the lens of horizontal, vertical, and hybrid FL paradigms, addressing modality-specific challenges and future directions.
Findings
Identifies modality heterogeneity as a key challenge in MFL.
Highlights privacy and communication issues unique to multimodal data.
Provides a structured overview of algorithms across FL paradigms.
Abstract
Multimodal Federated Learning (MFL) lies at the intersection of two pivotal research areas: leveraging complementary information from multiple modalities to improve downstream inference performance and enabling distributed training to enhance efficiency and preserve privacy. Despite the growing interest in MFL, there is currently no comprehensive taxonomy that organizes MFL through the lens of different Federated Learning (FL) paradigms. This perspective is important because multimodal data introduces distinct challenges across various FL settings. These challenges, including modality heterogeneity, privacy heterogeneity, and communication inefficiency, are fundamentally different from those encountered in traditional unimodal or non-FL scenarios. In this paper, we systematically examine MFL within the context of three major FL paradigms: horizontal FL (HFL), vertical FL (VFL), and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning · Human Mobility and Location-Based Analysis
