LLM-Slice: Dedicated Wireless Network Slicing for Large Language Models

Boyi Liu; Jingwen Tong; Jun Zhang

arXiv:2410.18499·cs.NI·October 25, 2024

LLM-Slice: Dedicated Wireless Network Slicing for Large Language Models

Boyi Liu, Jingwen Tong, Jun Zhang

PDF

Open Access

TL;DR

LLM-Slice introduces dedicated wireless network slices for large language models, significantly improving response speed and resource efficiency in wireless environments by tailored resource allocation and control.

Contribution

This work is the first to propose dedicated network slicing for LLMs in wireless networks, optimizing communication resources for faster and more reliable LLM services.

Findings

01

Significant improvement in response speed

02

Enhanced resource utilization

03

Reduced disconnections

Abstract

The rapid adoption of large language models (LLMs) presents new challenges for existing network architectures due to significant peak traffic and high communication uncertainty. Traditional wireless networks struggle to support efficiently, leading to intolerable response delays, disconnections, and resource wastage. To address these issues, we propose LLM-Slice, the first system to provide dedicated communication slices for LLMs within a wireless network environment. By creating LLM-specific network slices, LLM-Slice efficiently binds services with communication resources. Based on user equipment (UE) requests and a permissions database, the system registers specific slices to offer controllable LLM services, integrating a downlink resource control module to optimize response speed, enhance resource utilization, and reduce disconnections. By deploying and validating in a real UE-gNB-CN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIPv6, Mobility, Handover, Networks, Security · Mobile Ad Hoc Networks · Network Security and Intrusion Detection