TL;DR
This paper introduces GPSL, a scalable parallel split learning method that maintains fixed global batch sizes and improves training stability and accuracy in non-IID data settings, outperforming traditional methods.
Contribution
GPSL is a novel server-driven scheme that fixes global batch size, uses pooled-level proportions for local batch scheduling, and eliminates bias, enhancing scalability and performance in non-IID environments.
Findings
GPSL stabilizes optimization under non-IID splits.
GPSL achieves centralized-like accuracy on CIFAR datasets.
GPSL shortens training time by avoiding data depletion effects.
Abstract
Parallel split learning (PSL) suffers from two intertwined issues: the effective batch size grows with the number of clients, and data that is not identically and independently distributed (non-IID) skews global batches. We present parallel split learning with global sampling (GPSL), a server-driven scheme that fixes the global batch size while computing per-client batch-size schedules using pooled-level proportions. The actual samples are drawn locally without replacement by each selected client. This eliminates per-class rounding, decouples the effective batch from the client count, and makes each global batch distributionally equivalent to centralized uniform sampling without replacement. Consequently, we obtain finite-population deviation guarantees via Serfling's inequality, yielding a zero rounding bias compared to local sampling schemes. GPSL is a drop-in replacement for PSL with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
