SplitCom: Communication-efficient Split Federated Fine-tuning of LLMs via Temporal Compression
Tao Li, Yulin Tang, Yiyang Song, Cong Wu, Xihui Liu, Pan Li, Xianhao Chen

TL;DR
SplitCom introduces a novel communication-efficient split federated learning framework for LLMs that leverages temporal activation compression, significantly reducing communication costs while maintaining model performance.
Contribution
The paper proposes SplitCom, a new SFL framework that uses temporal redundancy and adaptive control to minimize communication overhead in federated LLM training.
Findings
Reduces uplink communication by up to 98.6%
Decreases total communication costs by up to 95.8%
Maintains comparable model performance
Abstract
Federated fine-tuning of on-device large language models (LLMs) mitigates privacy concerns by preventing raw data sharing. However, the intensive computational and memory demands pose significant challenges for resource-constrained edge devices. To overcome these limitations, split federated learning (SFL) emerges as a promising solution that partitions the model into lightweight client-side and compute-intensive server-side sub-models, thus offloading the primary training workload to a powerful server. Nevertheless, high-dimensional activation exchanges in SFL lead to excessive communication overhead. To overcome this, we propose SplitCom, a communication-efficient SFL framework for LLMs that exploits temporal redundancy in activations across consecutive training epochs. Inspired by video compression, the core innovation of our framework lies in selective activation uploading only when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · IoT and Edge/Fog Computing · Big Data and Digital Economy
