Straggler-resilient Federated Learning: Tackling Computation   Heterogeneity with Layer-wise Partial Model Training in Mobile Edge Network

Hongda Wu; Ping Wang; C V Aswartha Narayana

arXiv:2311.10002·cs.LG·November 17, 2023·2 cites

Straggler-resilient Federated Learning: Tackling Computation Heterogeneity with Layer-wise Partial Model Training in Mobile Edge Network

Hongda Wu, Ping Wang, C V Aswartha Narayana

PDF

Open Access

TL;DR

This paper introduces FedPMT, a federated learning method that allows resource-limited devices to train partial models, improving efficiency and accommodating heterogeneity in device capabilities.

Contribution

The paper proposes FedPMT, a novel partial model training approach for heterogeneous federated learning, with theoretical convergence guarantees and empirical superiority over benchmarks.

Findings

01

FedPMT outperforms FedDrop in empirical tests.

02

FedPMT achieves faster learning than FedAvg.

03

Theoretical analysis confirms similar convergence rate to FedAvg.

Abstract

Federated Learning (FL) enables many resource-limited devices to train a model collaboratively without data sharing. However, many existing works focus on model-homogeneous FL, where the global and local models are the same size, ignoring the inherently heterogeneous computational capabilities of different devices and restricting resource-constrained devices from contributing to FL. In this paper, we consider model-heterogeneous FL and propose Federated Partial Model Training (FedPMT), where devices with smaller computational capabilities work on partial models (subsets of the global model) and contribute to the global model. Different from Dropout-based partial model generation, which removes neurons in hidden layers at random, model training in FedPMT is achieved from the back-propagation perspective. As such, all devices in FedPMT prioritize the most crucial parts of the global…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data

MethodsFocus