Federated Dynamic Sparse Training: Computing Less, Communicating Less, Yet Learning Better
Sameer Bibikar, Haris Vikalo, Zhangyang Wang, Xiaohan Chen

TL;DR
FedDST introduces a dynamic sparse training framework for federated learning, significantly reducing computation and communication costs while improving accuracy through adaptive sparse sub-networks.
Contribution
This paper presents FedDST, a novel federated learning approach that dynamically trains sparse networks, enhancing efficiency and performance over traditional dense models.
Findings
FedDST outperforms FedAvgM by 10% accuracy at the same upload data cap.
Dynamic sparsity better handles local heterogeneity in FL agents.
FedDST improves training efficiency and accuracy even with bandwidth constraints.
Abstract
Federated learning (FL) enables distribution of machine learning workloads from the cloud to resource-limited edge devices. Unfortunately, current deep networks remain not only too compute-heavy for inference and training on edge devices, but also too large for communicating updates over bandwidth-constrained networks. In this paper, we develop, implement, and experimentally validate a novel FL framework termed Federated Dynamic Sparse Training (FedDST) by which complex neural networks can be deployed and trained with substantially improved efficiency in both on-device computation and in-network communication. At the core of FedDST is a dynamic process that extracts and trains sparse sub-networks from the target full network. With this scheme, "two birds are killed with one stone:" instead of full models, each client performs efficient training of its own sparse networks, and only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Traffic Prediction and Management Techniques · Machine Learning and ELM
MethodsNetwork On Network
