SPATL: Salient Parameter Aggregation and Transfer Learning for   Heterogeneous Clients in Federated Learning

Sixing Yu; Phuong Nguyen; Waqwoya Abebe; Wei Qian; Ali Anwar; Ali; Jannesari

arXiv:2111.14345·cs.LG·August 29, 2022·5 cites

SPATL: Salient Parameter Aggregation and Transfer Learning for Heterogeneous Clients in Federated Learning

Sixing Yu, Phuong Nguyen, Waqwoya Abebe, Wei Qian, Ali Anwar, Ali, Jannesari

PDF

Open Access 1 Repo

TL;DR

SPATL is a federated learning method that reduces communication costs, accelerates inference, and improves training stability by selecting salient parameters, splitting models for heterogeneity, and controlling gradients.

Contribution

It introduces a salient parameter selection, model splitting for heterogeneous clients, and gradient control, advancing federated learning efficiency and robustness.

Findings

01

Reduces communication cost by up to 86.45%

02

Speeds up local inference by up to 39.7% FLOPs

03

Requires 7.4 times less communication overhead for ResNet-20

Abstract

Federated learning~(FL) facilitates the training and deploying AI models on edge devices. Preserving user data privacy in FL introduces several challenges, including expensive communication costs, limited resources, and data heterogeneity. In this paper, we propose SPATL, an FL method that addresses these issues by: (a) introducing a salient parameter selection agent and communicating selected parameters only; (b) splitting a model into a shared encoder and a local predictor, and transferring its knowledge to heterogeneous clients via the locally customized predictor. Additionally, we leverage a gradient control mechanism to further speed up model convergence and increase robustness of training processes. Experiments demonstrate that SPATL reduces communication overhead, accelerates model inference, and enables stable training processes with better results compared to state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yusx-swapp/spatl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Vehicular Ad Hoc Networks (VANETs)

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings