HO-SFL: Hybrid-Order Split Federated Learning with Backprop-Free Clients and Dimension-Free Aggregation

Qiyuan Chen; Xian Wu; Yi Wang; Xianhao Chen

arXiv:2603.14773·cs.LG·March 17, 2026

HO-SFL: Hybrid-Order Split Federated Learning with Backprop-Free Clients and Dimension-Free Aggregation

Qiyuan Chen, Xian Wu, Yi Wang, Xianhao Chen

PDF

Open Access

TL;DR

HO-SFL introduces a hybrid federated learning framework that combines first-order server updates with zeroth-order client optimization, reducing memory and communication costs while maintaining fast convergence.

Contribution

This work presents a novel hybrid split federated learning method that decouples client and server optimization, enabling dimension-free aggregation and improved convergence.

Findings

01

HO-SFL reduces client memory usage by eliminating backpropagation.

02

HO-SFL achieves convergence speeds comparable to first-order methods.

03

HO-SFL significantly lowers communication costs in federated learning.

Abstract

Fine-tuning large models on edge devices is severely hindered by the memory-intensive backpropagation (BP) in standard frameworks like federated learning and split learning. While substituting BP with zeroth-order optimization can significantly reduce memory footprints, it typically suffers from prohibitively degraded convergence speed. To resolve this dilemma, we propose Hybrid-Order Split Federated Learning (HO-SFL). By reformulating the split learning process within a Lagrangian framework, HO-SFL decouples the optimization landscape: The server performs precise first-order updates (i.e., BP), whereas clients conduct memory-efficient zeroth-order optimization. This hybrid design not only eliminates the need for client-side BP but also enables dimension-free model aggregation, drastically lowering communication costs. Crucially, we provide a theoretical convergence analysis,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications