SPD-CFL: Stepwise Parameter Dropout for Efficient Continual Federated Learning
Yuning Yang, Han Yu, Chuan Sun, Tianrun Gao, Xiaohong Liu, Xiaodong Xu, Ping Zhang, and Guangyu Wang

TL;DR
SPD-CFL introduces a stepwise parameter dropout method for efficient continual federated learning, enabling adaptive performance-targeted model compression with reduced communication costs and improved accuracy.
Contribution
It proposes a novel adaptive dropout approach that automatically finds suitable dropout rates based on target performance, enhancing federated learning efficiency.
Findings
Achieves 2.07% higher test AUC on CIFAR-10 and medical datasets.
Reduces communication overhead by 29.53%.
Demonstrates superiority over state-of-the-art methods.
Abstract
Federated Learning (FL) is a collaborative machine learning paradigm for training models on local sensitive data with privacy protection. Pre-trained transformer-based models have emerged as useful foundation models (FMs) to be fine-tuned for a wide range of downstream tasks. However, large-scale pre-trained models make it challenging for traditional FL due to high communication overhead in the resource-constrained IoT. This has inspired the field of parameter-efficient fine-tuning (PEFT) research. Existing PEFT methods attempt to optimize model performance at the given dropout level. Such an approach places the burden on human users to find a dropout rate that provides a satisfactory level of performance through trial-and-error, which is time consuming and resource intensive. To address this limitation, we propose the Step-wise Parameter Dropout for Continual Federated Learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Brain Tumor Detection and Classification · Stochastic Gradient Optimization Techniques
