Fast AI Model Partition for Split Learning over Edge Networks
Zuguang Li, Wen Wu, Shaohua Wu, and Xuemin (Sherman) Shen

TL;DR
This paper introduces a graph-based optimal model partitioning method for split learning that minimizes training delay and improves efficiency over edge networks.
Contribution
It formulates model partitioning as a minimum s-t cut problem, providing a low-complexity algorithm that significantly reduces training delay and computation time.
Findings
Reduces training delay by up to 38.95% compared to baselines.
Cuts algorithm running time by up to 13.0 times.
Validates effectiveness on NVIDIA Jetson hardware testbed.
Abstract
Split learning (SL) is a distributed learning paradigm that can enable computation-intensive artificial intelligence (AI) applications by partitioning AI models between mobile devices and edge servers. %fully utilizing distributed computing resources for computation-intensive mobile intelligence applications. However, the model partitioning problem in SL becomes challenging due to the diverse and complex architectures of AI models. In this paper, we formulate an optimal model partitioning problem to minimize training delay in SL. To solve the problem, we represent an arbitrary AI model as a directed acyclic graph (DAG), where the model's layers and inter-layer connections are mapped to vertices and edges, and training delays are captured as edge weights. Then, we propose a general model partitioning algorithm by transforming the problem into a minimum \textit{s-t} cut problem on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
