Resource Allocation for Stable LLM Training in Mobile Edge Computing
Chang Liu, Jun Zhao

TL;DR
This paper proposes a resource-efficient collaborative training framework for large language models in mobile edge computing, combining parameter-efficient fine-tuning with optimization techniques to reduce energy, delay, and improve stability.
Contribution
It introduces a novel multi-objective optimization approach integrating PEFT methods for stable LLM training in mobile edge environments.
Findings
Reduces energy consumption during training
Lowers latency in model deployment
Enhances model stability and reliability
Abstract
As mobile devices increasingly become focal points for advanced applications, edge computing presents a viable solution to their inherent computational limitations, particularly in deploying large language models (LLMs). However, despite the advancements in edge computing, significant challenges remain in efficient training and deploying LLMs due to the computational demands and data privacy concerns associated with these models. This paper explores a collaborative training framework that integrates mobile users with edge servers to optimize resource allocation, thereby enhancing both performance and efficiency. Our approach leverages parameter-efficient fine-tuning (PEFT) methods, allowing mobile users to adjust the initial layers of the LLM while edge servers handle the more demanding latter layers. Specifically, we formulate a multi-objective optimization problem to minimize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management
