Ampere: Communication-Efficient and High-Accuracy Split Federated Learning
Zihan Zhang, Leon Wong, Blesson Varghese

TL;DR
Ampere introduces a communication-efficient split federated learning system that enhances model accuracy and reduces computation and communication costs by using unidirectional training and auxiliary networks, especially effective with non-IID data.
Contribution
It proposes a novel unidirectional inter-block training method with auxiliary networks, significantly reducing communication overhead and improving accuracy over existing split federated learning approaches.
Findings
Up to 13.26% accuracy improvement
Up to 94.6% reduction in training time
Up to 99.1% decrease in communication overhead
Abstract
A Federated Learning (FL) system collaboratively trains neural networks across devices and a server but is limited by significant on-device computation costs. Split Federated Learning (SFL) systems mitigate this by offloading a block of layers of the network from the device to a server. However, in doing so, it introduces large communication overheads due to frequent exchanges of intermediate activations and gradients between devices and the server and reduces model accuracy for non-IID data. We propose Ampere, a novel collaborative training system that simultaneously minimizes on-device computation and device-server communication while improving model accuracy. Unlike SFL, which uses a global loss by iterative end-to-end training, Ampere develops unidirectional inter-block training to sequentially train the device and server block with a local loss, eliminating the transfer of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Big Data and Digital Economy · Advanced Data and IoT Technologies
