NeFL: Nested Model Scaling for Federated Learning with System   Heterogeneous Clients

Honggu Kang; Seohyeon Cha; Jinwoo Shin; Jongmyeong Lee; Joonhyuk Kang

arXiv:2308.07761·cs.LG·September 11, 2024

NeFL: Nested Model Scaling for Federated Learning with System Heterogeneous Clients

Honggu Kang, Seohyeon Cha, Jinwoo Shin, Jongmyeong Lee, Joonhyuk Kang

PDF

Open Access

TL;DR

NeFL introduces a flexible nested model scaling framework for federated learning that improves performance and resource efficiency across heterogeneous clients by training multiple submodels with decoupled parameters.

Contribution

It proposes a novel nested federated learning framework that allows flexible model subdivision and parameter decoupling to handle system heterogeneity in FL.

Findings

01

Achieves 7.63% performance improvement on CIFAR-100 for the worst-case submodel.

02

Effectively enables resource-constrained devices to participate in FL.

03

Aligns with recent FL advances like pre-trained models and statistical heterogeneity.

Abstract

Federated learning (FL) enables distributed training while preserving data privacy, but stragglers-slow or incapable clients-can significantly slow down the total training time and degrade performance. To mitigate the impact of stragglers, system heterogeneity, including heterogeneous computing and network bandwidth, has been addressed. While previous studies have addressed system heterogeneity by splitting models into submodels, they offer limited flexibility in model architecture design, without considering potential inconsistencies arising from training multiple submodel architectures. We propose nested federated learning (NeFL), a generalized framework that efficiently divides deep neural networks into submodels using both depthwise and widthwise scaling. To address the inconsistency arising from training multiple submodel architectures, NeFL decouples a subset of parameters from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data