Leveraging Foundation Models for Multi-modal Federated Learning with   Incomplete Modality

Liwei Che; Jiaqi Wang; Xinyue Liu; Fenglong Ma

arXiv:2406.11048·cs.LG·June 18, 2024

Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality

Liwei Che, Jiaqi Wang, Xinyue Liu, Fenglong Ma

PDF

Open Access

TL;DR

This paper introduces FedMVP, a federated learning approach that leverages large-scale pre-trained models to handle incomplete modalities across clients, improving robustness and performance in multi-modal data scenarios.

Contribution

The paper proposes FedMVP, a novel multi-modal federated learning framework that uses pre-trained models for modality completion and a graph-based aggregation method.

Findings

01

FedMVP outperforms existing methods on image-text classification datasets.

02

The approach is robust to missing modalities during training and inference.

03

Pre-trained models enhance local training efficiency and model robustness.

Abstract

Federated learning (FL) has obtained tremendous progress in providing collaborative training solutions for distributed data silos with privacy guarantees. However, few existing works explore a more realistic scenario where the clients hold multiple data modalities. In this paper, we aim to solve a novel challenge in multi-modal federated learning (MFL) -- modality missing -- the clients may lose part of the modalities in their local data sets. To tackle the problems, we propose a novel multi-modal federated learning method, Federated Multi-modal contrastiVe training with Pre-trained completion (FedMVP), which integrates the large-scale pre-trained models to enhance the federated training. In the proposed FedMVP framework, each client deploys a large-scale pre-trained model with frozen parameters for modality completion and representation knowledge transfer, enabling efficient and robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Topic Modeling · Advanced Graph Neural Networks