FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation
Ziwei Zhan, Wenkuan Zhao, Yuanqing Li, Weijie Liu, Xiaoxi Zhang, Chee Wei Tan, Chuan Wu, Deke Guo, and Xu Chen

TL;DR
FedMoE-DA introduces a federated learning framework using Mixture of Experts with domain-aware aggregation and peer-to-peer synchronization, improving personalization, robustness, and reducing communication costs.
Contribution
This work presents a novel federated learning method combining MoE architecture with domain-aware fine-grained aggregation and P2P communication to enhance efficiency and personalization.
Findings
Reduces communication overhead significantly.
Achieves high model performance and personalization.
Enhances robustness against data heterogeneity.
Abstract
Federated learning (FL) is a collaborative machine learning approach that enables multiple clients to train models without sharing their private data. With the rise of deep learning, large-scale models have garnered significant attention due to their exceptional performance. However, a key challenge in FL is the limitation imposed by clients with constrained computational and communication resources, which hampers the deployment of these large models. The Mixture of Experts (MoE) architecture addresses this challenge with its sparse activation property, which reduces computational workload and communication demands during inference and updates. Additionally, MoE facilitates better personalization by allowing each expert to specialize in different subsets of the data distribution. To alleviate the communication burdens between the server and clients, we propose FedMoE-DA, a new FL model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Human Mobility and Location-Based Analysis · Mobile Crowdsensing and Crowdsourcing
MethodsSoftmax · Attention Is All You Need · Mixture of Experts
