Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model
Runtao Ren, Yinyu Wu, Xuhui Zhang, Jinke Ren, Yanyan Shen, and Shuqiang Wang, Kim-Fung Tsang

TL;DR
This paper introduces a retrieval-augmented generation approach using large language models to optimize resource allocation in mobile edge computing, significantly reducing latency in dynamic multi-user scenarios.
Contribution
It presents a novel LLM-enabled retrieval mechanism for real-time resource allocation in MEC, addressing scalability, context-awareness, and interpretability issues of prior methods.
Findings
Achieves up to 86% latency reduction in dynamic MEC scenarios.
Demonstrates effectiveness across multi-user and multi-task environments.
Outperforms existing DL-based approaches in various settings.
Abstract
The rapid evolution of mobile edge computing (MEC) has introduced significant challenges in optimizing resource allocation in highly dynamic wireless communication systems, in which task offloading decisions should be made in real-time. However, existing resource allocation strategies cannot well adapt to the dynamic and heterogeneous characteristics of MEC systems, since they are short of scalability, context-awareness, and interpretability. To address these issues, this paper proposes a novel retrieval-augmented generation (RAG) method to improve the performance of MEC systems. Specifically, a latency minimization problem is first proposed to jointly optimize the data offloading ratio, transmit power allocation, and computing resource allocation. Then, an LLM-enabled information-retrieval mechanism is proposed to solve the problem efficiently. Extensive experiments across multi-user,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling
