Fast AI Model Partition for Split Learning over Edge Networks

Zuguang Li; Wen Wu; Shaohua Wu; and Xuemin (Sherman) Shen

arXiv:2507.01041·cs.LG·April 15, 2026

Fast AI Model Partition for Split Learning over Edge Networks

Zuguang Li, Wen Wu, Shaohua Wu, and Xuemin (Sherman) Shen

PDF

TL;DR

This paper introduces a graph-based optimal model partitioning method for split learning that minimizes training delay and improves efficiency over edge networks.

Contribution

It formulates model partitioning as a minimum s-t cut problem, providing a low-complexity algorithm that significantly reduces training delay and computation time.

Findings

01

Reduces training delay by up to 38.95% compared to baselines.

02

Cuts algorithm running time by up to 13.0 times.

03

Validates effectiveness on NVIDIA Jetson hardware testbed.

Abstract

Split learning (SL) is a distributed learning paradigm that can enable computation-intensive artificial intelligence (AI) applications by partitioning AI models between mobile devices and edge servers. %fully utilizing distributed computing resources for computation-intensive mobile intelligence applications. However, the model partitioning problem in SL becomes challenging due to the diverse and complex architectures of AI models. In this paper, we formulate an optimal model partitioning problem to minimize training delay in SL. To solve the problem, we represent an arbitrary AI model as a directed acyclic graph (DAG), where the model's layers and inter-layer connections are mapped to vertices and edges, and training delays are captured as edge weights. Then, we propose a general model partitioning algorithm by transforming the problem into a minimum \textit{s-t} cut problem on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.