HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning   Framework for Large Language Models

Zheng Lin; Yuxin Zhang; Zhe Chen; Zihan Fang; Xianhao Chen; Praneeth; Vepakomma; Wei Ni; Jun Luo; Yue Gao

arXiv:2505.02795·cs.LG·May 6, 2025

HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models

Zheng Lin, Yuxin Zhang, Zhe Chen, Zihan Fang, Xianhao Chen, Praneeth, Vepakomma, Wei Ni, Jun Luo, Yue Gao

PDF

Open Access

TL;DR

HSplitLoRA is a novel framework that enables efficient, heterogeneous fine-tuning of large language models on devices with varying resources by combining split learning and low-rank adaptation techniques.

Contribution

It introduces a dynamic, resource-aware fine-tuning method that adapts model splits and adapter ranks for heterogeneous devices, improving efficiency and performance.

Findings

01

Outperforms existing benchmarks in accuracy and convergence speed.

02

Effectively handles device heterogeneity in federated fine-tuning.

03

Reduces computational costs for large language model adaptation.

Abstract

Recently, large language models (LLMs) have achieved remarkable breakthroughs, revolutionizing the natural language processing domain and beyond. Due to immense parameter sizes, fine-tuning these models with private data for diverse downstream tasks has become mainstream. Though federated learning (FL) offers a promising solution for fine-tuning LLMs without sharing raw data, substantial computing costs hinder its democratization. Moreover, in real-world scenarios, private client devices often possess heterogeneous computing resources, further complicating LLM fine-tuning. To combat these challenges, we propose HSplitLoRA, a heterogeneous parameter-efficient fine-tuning (PEFT) framework built on split learning (SL) and low-rank adaptation (LoRA) fine-tuning, for efficiently fine-tuning LLMs on heterogeneous client devices. HSplitLoRA first identifies important weights based on their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsAdapter