CyclicFL: A Cyclic Model Pre-Training Approach to Efficient Federated   Learning

Pengyu Zhang; Yingbo Zhou; Ming Hu; Xian Wei; and Mingsong Chen

arXiv:2301.12193·cs.LG·September 6, 2024·1 cites

CyclicFL: A Cyclic Model Pre-Training Approach to Efficient Federated Learning

Pengyu Zhang, Yingbo Zhou, Ming Hu, Xian Wei, and Mingsong Chen

PDF

Open Access

TL;DR

CyclicFL is a novel federated learning pre-training approach that enhances convergence speed and accuracy by deriving effective initial models without exposing local data, especially benefiting non-IID scenarios.

Contribution

The paper introduces CyclicFL, a cyclic pre-training method for FL that improves convergence and accuracy without requiring public data, and provides theoretical analysis of its effectiveness.

Findings

01

Up to 14.11% increase in classification accuracy.

02

Significantly faster convergence in FL training.

03

Effective on non-IID data scenarios.

Abstract

Federated learning (FL) has been proposed to enable distributed learning on Artificial Intelligence Internet of Things (AIoT) devices with guarantees of high-level data privacy. Since random initial models in FL can easily result in unregulated Stochastic Gradient Descent (SGD) processes, existing FL methods greatly suffer from both slow convergence and poor accuracy, especially in non-IID scenarios. To address this problem, we propose a novel method named CyclicFL, which can quickly derive effective initial models to guide the SGD processes, thus improving the overall FL training performance. We formally analyze the significance of data consistency between the pre-training and training stages of CyclicFL, showing the limited Lipschitzness of loss for the pre-trained models by CyclicFL. Moreover, we systematically prove that our method can achieve faster convergence speed under various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Internet Traffic Analysis and Secure E-voting

MethodsStochastic Gradient Descent