SplitFT: An Adaptive Federated Split Learning System For LLMs Fine-Tuning
Yimeng Shan, Zhaorui Zhang, Sheng Di, Yu Liu, Xiaoyi Lu, Benben Liu

TL;DR
SplitFT is an adaptive federated split learning system designed for efficient LLM fine-tuning, addressing heterogeneity and communication challenges in distributed environments.
Contribution
It introduces adaptive cutlayer selection and LoRA rank reduction to improve efficiency and performance in federated LLM fine-tuning.
Findings
Outperforms state-of-the-art methods in fine-tuning efficiency.
Effectively handles data and device heterogeneity.
Reduces communication overhead during training.
Abstract
Federated Split Learning has been identified as an efficient approach to address the computational resource constraints of clients in classical federated learning, while guaranteeing data privacy for distributed model training across data owners. However, it faces some critical challenges when such a training strategy meets large language models (LLMs) for fine-tuning. Such challenges include setting the cutlayer adaptively across different clients to address the data and device heterogeneity issues, which affect the system performance significantly. In addition, efficiently reducing the communication overhead during the fine-tuning procedure is also another challenge. No work tries to address these challenges. To bridge this gap, we propose SplitTF, an adaptive federated split learning system for LLMs fine-tuning. SplitFT enables different clients to set different cut layers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
