Resource-Efficient Federated Multimodal Learning via Layer-wise and Progressive Training
Ye Lin Tun, Chu Myaet Thwal, Minh N. H. Nguyen, Choong Seon Hong

TL;DR
This paper introduces LW-FedMML, a resource-efficient federated multimodal learning method that reduces computational, memory, and communication costs by training models in stages, enabling effective multimodal learning with limited client resources.
Contribution
It proposes a layer-wise, staged training approach for federated multimodal learning, significantly lowering resource requirements compared to traditional end-to-end methods.
Findings
LW-FedMML reduces memory usage by up to 2.7 times.
It cuts computational FLOPs by 2.4 times.
Communication cost decreases by 2.3 times.
Abstract
Combining different data modalities enables deep neural networks to tackle complex tasks more effectively, making multimodal learning increasingly popular. To harness multimodal data closer to end users, it is essential to integrate multimodal learning with privacy-preserving approaches like federated learning (FL). However, compared to conventional unimodal learning, multimodal setting requires dedicated encoders for each modality, resulting in larger and more complex models. Training these models requires significant resources, presenting a substantial challenge for FL clients operating with limited computation and communication resources. To address these challenges, we introduce LW-FedMML, a layer-wise federated multimodal learning approach which decomposes the training process into multiple stages. Each stage focuses on training only a portion of the model, thereby significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting · Indoor and Outdoor Localization Technologies
