HFedMoE: Resource-aware Heterogeneous Federated Learning with Mixture-of-Experts
Zihan Fang, Zheng Lin, Senkang Hu, Yanan Ma, Yihang Tao, Yiqin Deng, Xianhao Chen, Yuguang Fang

TL;DR
HFedMoE introduces a resource-aware federated learning framework using a mixture-of-experts approach, enabling efficient large language model fine-tuning on resource-constrained devices by adaptive expert selection and importance-weighted aggregation.
Contribution
The paper proposes HFedMoE, a novel framework that dynamically selects and aggregates experts in federated learning to address resource heterogeneity and improve performance.
Findings
HFedMoE achieves higher training accuracy than benchmarks.
HFedMoE converges faster during training.
The framework effectively handles diverse client resources.
Abstract
While federated learning (FL) enables fine-tuning of large language models (LLMs) without compromising data privacy, the substantial size of an LLM renders on-device training impractical for resource-constrained clients, such as mobile devices. Thus, Mixture-of-Experts (MoE) models have emerged as a computation-efficient solution, which activates only a sparse subset of experts during model training to reduce computing burden without sacrificing performance. Though integrating MoE into FL fine-tuning holds significant potential, it still encounters three key challenges: i) selecting appropriate experts for clients remains challenging due to the lack of a reliable metric to measure each expert's impact on local fine-tuning performance, ii) the heterogeneous computing resources across clients severely hinder MoE-based LLM fine-tuning, as dynamic expert activations across diverse input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Big Data and Digital Economy
