SSFL: Discovering Sparse Unified Subnetworks at Initialization for Efficient Federated Learning

Riyasat Ohib; Bishal Thapaliya; Gintare Karolina Dziugaite; Jingyu Liu; Vince Calhoun; Sergey Plis

arXiv:2405.09037·cs.LG·January 16, 2026

SSFL: Discovering Sparse Unified Subnetworks at Initialization for Efficient Federated Learning

Riyasat Ohib, Bishal Thapaliya, Gintare Karolina Dziugaite, Jingyu Liu, Vince Calhoun, Sergey Plis

PDF

Open Access

TL;DR

SSFL introduces a method to identify and train sparse subnetworks at initialization in federated learning, significantly reducing communication costs and improving accuracy in non-IID data scenarios.

Contribution

The paper presents SSFL, a novel approach that finds sparse subnetworks before training, enhancing communication efficiency and accuracy in federated learning with non-IID data.

Findings

01

Achieves over 20% error reduction on CIFAR-10 compared to strong sparse baselines.

02

Reduces communication costs by 2x relative to dense federated learning.

03

Delivers over 2.3x faster communication time in real-world deployment.

Abstract

In this work, we propose Salient Sparse Federated Learning (SSFL), a streamlined approach for sparse federated learning with efficient communication. SSFL identifies a sparse subnetwork prior to training, leveraging parameter saliency scores computed separately on local client data in non-IID scenarios, and then aggregated, to determine a global mask. Only the sparse model weights are trained and communicated each round between the clients and the server. On standard benchmarks including CIFAR-10, CIFAR-100, and Tiny-ImageNet, SSFL consistently improves the accuracy sparsity trade off, achieving more than 20\% relative error reduction on CIFAR-10 compared to the strongest sparse baseline, while reducing communication costs by $2 \times$ relative to dense FL. Finally, in a real-world federated learning deployment, SSFL delivers over $2.3 \times$ faster communication time, underscoring…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Graph Neural Networks