Straggler-resilient Federated Learning: Tackling Computation Heterogeneity with Layer-wise Partial Model Training in Mobile Edge Network
Hongda Wu, Ping Wang, C V Aswartha Narayana

TL;DR
This paper introduces FedPMT, a federated learning method that allows resource-limited devices to train partial models, improving efficiency and accommodating heterogeneity in device capabilities.
Contribution
The paper proposes FedPMT, a novel partial model training approach for heterogeneous federated learning, with theoretical convergence guarantees and empirical superiority over benchmarks.
Findings
FedPMT outperforms FedDrop in empirical tests.
FedPMT achieves faster learning than FedAvg.
Theoretical analysis confirms similar convergence rate to FedAvg.
Abstract
Federated Learning (FL) enables many resource-limited devices to train a model collaboratively without data sharing. However, many existing works focus on model-homogeneous FL, where the global and local models are the same size, ignoring the inherently heterogeneous computational capabilities of different devices and restricting resource-constrained devices from contributing to FL. In this paper, we consider model-heterogeneous FL and propose Federated Partial Model Training (FedPMT), where devices with smaller computational capabilities work on partial models (subsets of the global model) and contribute to the global model. Different from Dropout-based partial model generation, which removes neurons in hidden layers at random, model training in FedPMT is achieved from the back-propagation perspective. As such, all devices in FedPMT prioritize the most crucial parts of the global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
MethodsFocus
