Joint Optimization of Prompt Security and System Performance in Edge-Cloud LLM Systems
Haiyang Huang, Tianhui Meng, Weijia Jia

TL;DR
This paper presents a joint optimization approach for prompt security, latency, and resource use in Edge-Cloud LLM systems, using a Bayesian game model and a lightweight attack detector to improve safety and efficiency.
Contribution
It introduces a novel multi-stage Bayesian game model for joint prompt security, latency, and resource optimization, along with a vector-database-enabled attack detector for EC-LLM systems.
Findings
Enhanced security against prompt attacks
Reduced latency for benign users
Lower system resource consumption
Abstract
Large language models (LLMs) have significantly facilitated human life, and prompt engineering has improved the efficiency of these models. However, recent years have witnessed a rise in prompt engineering-empowered attacks, leading to issues such as privacy leaks, increased latency, and system resource wastage. Though safety fine-tuning based methods with Reinforcement Learning from Human Feedback (RLHF) are proposed to align the LLMs, existing security mechanisms fail to cope with fickle prompt attacks, highlighting the necessity of performing security detection on prompts. In this paper, we jointly consider prompt security, service latency, and system resource optimization in Edge-Cloud LLM (EC-LLM) systems under various prompt attacks. To enhance prompt security, a vector-database-enabled lightweight attack detector is proposed. We formalize the problem of joint prompt detection,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Software-Defined Networks and 5G
Methodstravel james · ALIGN
