Energy-Efficient Split Learning for Fine-Tuning Large Language Models in   Edge Networks

Zuguang Li; Shaohua Wu; Liang Li; and Songge Zhang

arXiv:2412.00090·cs.LG·January 15, 2025

Energy-Efficient Split Learning for Fine-Tuning Large Language Models in Edge Networks

Zuguang Li, Shaohua Wu, Liang Li, and Songge Zhang

PDF

Open Access

TL;DR

This paper introduces an energy-efficient split learning framework for fine-tuning large language models at the network edge, optimizing delay and energy use across heterogeneous devices.

Contribution

It proposes the CARD algorithm to minimize training delay and energy consumption considering device heterogeneity and channel dynamics.

Findings

01

Reduces training delay by 70.8%

02

Lowers server energy consumption by 53.1%

03

Effective for geo-distributed edge networks

Abstract

In this letter, we propose an energy-efficient split learning (SL) framework for fine-tuning large language models (LLMs) using geo-distributed personal data at the network edge, where LLMs are split and alternately across massive mobile devices and an edge server. Considering the device heterogeneity and channel dynamics in edge networks, a \underline{C}ut l\underline{A}yer and computing \underline{R}esource \underline{D}ecision (CARD) algorithm is developed to minimize training delay and energy consumption. Simulation results demonstrate that the proposed approach reduces the average training delay and server's energy consumption by 70.8% and 53.1%, compared to the benchmarks, respectively.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks