When MiniBatch SGD Meets SplitFed Learning:Convergence Analysis and Performance Evaluation
Chao Huang, Geng Tian, Ming Tang

TL;DR
This paper introduces MiniBatch-SFL, a novel split federated learning algorithm that combines MiniBatch SGD to improve convergence and mitigate client drift, especially with non-IID data, leading to significant accuracy gains.
Contribution
The paper proposes MiniBatch-SFL, integrating MiniBatch SGD into split federated learning to enhance convergence analysis and performance with non-IID data.
Findings
MiniBatch-SFL achieves up to 24.1% higher accuracy with non-IID data.
Server-side updates are unaffected by data non-IIDness, aiding convergence.
A later cut layer position improves gradient divergence and overall performance.
Abstract
Federated learning (FL) enables collaborative model training across distributed clients (e.g., edge devices) without sharing raw data. Yet, FL can be computationally expensive as the clients need to train the entire model multiple times. SplitFed learning (SFL) is a recent distributed approach that alleviates computation workload at the client device by splitting the model at a cut layer into two parts, where clients only need to train part of the model. However, SFL still suffers from the \textit{client drift} problem when clients' data are highly non-IID. To address this issue, we propose MiniBatch-SFL. This algorithm incorporates MiniBatch SGD into SFL, where the clients train the client-side model in an FL fashion while the server trains the server-side model similar to MiniBatch SGD. We analyze the convergence of MiniBatch-SFL and show that the bound of the expected loss can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Traffic Prediction and Management Techniques
MethodsStochastic Gradient Descent
