Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models
Jun Luo, Chen Chen, Shandong Wu

TL;DR
This paper introduces pFedMoAP, a federated learning framework that personalizes vision-language prompts using a mixture of experts approach, improving alignment and performance across multiple datasets.
Contribution
It proposes a novel personalized federated prompt learning method employing multiple pre-aggregated prompts as experts, enhancing adaptation for lightweight prompts.
Findings
Effective across 9 datasets in various federated settings.
Improves prompt personalization and alignment.
Demonstrates superior performance over existing methods.
Abstract
Federated prompt learning benefits federated learning with CLIP-like Vision-Language Model's (VLM's) robust representation learning ability through prompt learning. However, current federated prompt learning methods are habitually restricted to the traditional FL paradigm, where the participating clients are generally only allowed to download a single globally aggregated model from the server. While justifiable for training full-sized models under federated settings, in this work, we argue that this paradigm is ill-suited for lightweight prompts. By facilitating the clients to download multiple pre-aggregated prompts as fixed non-local experts, we propose Personalized Federated Mixture of Adaptive Prompts (pFedMoAP), a novel FL framework that personalizes the prompt learning process through the lens of Mixture of Experts (MoE). pFedMoAP implements a local attention-based gating network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks
MethodsContrastive Language-Image Pre-training
