Joint Optimization of Prompt Security and System Performance in   Edge-Cloud LLM Systems

Haiyang Huang; Tianhui Meng; Weijia Jia

arXiv:2501.18663·cs.CR·February 3, 2025

Joint Optimization of Prompt Security and System Performance in Edge-Cloud LLM Systems

Haiyang Huang, Tianhui Meng, Weijia Jia

PDF

Open Access

TL;DR

This paper presents a joint optimization approach for prompt security, latency, and resource use in Edge-Cloud LLM systems, using a Bayesian game model and a lightweight attack detector to improve safety and efficiency.

Contribution

It introduces a novel multi-stage Bayesian game model for joint prompt security, latency, and resource optimization, along with a vector-database-enabled attack detector for EC-LLM systems.

Findings

01

Enhanced security against prompt attacks

02

Reduced latency for benign users

03

Lower system resource consumption

Abstract

Large language models (LLMs) have significantly facilitated human life, and prompt engineering has improved the efficiency of these models. However, recent years have witnessed a rise in prompt engineering-empowered attacks, leading to issues such as privacy leaks, increased latency, and system resource wastage. Though safety fine-tuning based methods with Reinforcement Learning from Human Feedback (RLHF) are proposed to align the LLMs, existing security mechanisms fail to cope with fickle prompt attacks, highlighting the necessity of performing security detection on prompts. In this paper, we jointly consider prompt security, service latency, and system resource optimization in Edge-Cloud LLM (EC-LLM) systems under various prompt attacks. To enhance prompt security, a vector-database-enabled lightweight attack detector is proposed. We formalize the problem of joint prompt detection,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing · Software-Defined Networks and 5G

Methodstravel james · ALIGN