PM-MOE: Mixture of Experts on Private Model Parameters for Personalized   Federated Learning

Yu Feng; Yangli-ao Geng; Yifan Zhu; Zongfu Han; Xie Yu; Kaiwen Xue,; Haoran Luo; Mengyang Sun; Guangwei Zhang; Meina Song

arXiv:2502.00354·cs.LG·February 4, 2025

PM-MOE: Mixture of Experts on Private Model Parameters for Personalized Federated Learning

Yu Feng, Yangli-ao Geng, Yifan Zhu, Zongfu Han, Xie Yu, Kaiwen Xue,, Haoran Luo, Mengyang Sun, Guangwei Zhang, Meina Song

PDF

1 Repo

TL;DR

The paper introduces PM-MoE, a novel architecture for personalized federated learning that leverages a mixture of expert modules and energy-based denoising to improve model personalization across diverse data domains.

Contribution

It proposes the PM-MoE architecture that enhances personalized federated learning by integrating expert modules and denoising, improving performance with minimal additional training.

Findings

01

Significant performance improvements on six datasets.

02

Effective across nine model-split-based algorithms.

03

Validated under two heterogeneity settings.

Abstract

Federated learning (FL) has gained widespread attention for its privacy-preserving and collaborative learning capabilities. Due to significant statistical heterogeneity, traditional FL struggles to generalize a shared model across diverse data domains. Personalized federated learning addresses this issue by dividing the model into a globally shared part and a locally private part, with the local model correcting representation biases introduced by the global model. Nevertheless, locally converged parameters more accurately capture domain-specific knowledge, and current methods overlook the potential benefits of these parameters. To address these limitations, we propose PM-MoE architecture. This architecture integrates a mixture of personalized modules and an energy-based personalized modules denoising, enabling each client to select beneficial personalized parameters from other clients.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dannis97500/pm-moe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · Attention Is All You Need