How Vulnerable Are Edge LLMs?

Ao Ding; Hongzong Li; Zi Liang; Zhanpeng Shi; Shuxin Zhuang; Shiqin Tang; Rong Feng; Ping Lu

arXiv:2603.23822·cs.CR·March 26, 2026

How Vulnerable Are Edge LLMs?

Ao Ding, Hongzong Li, Zi Liang, Zhanpeng Shi, Shuxin Zhuang, Shiqin Tang, Rong Feng, Ping Lu

PDF

Open Access

TL;DR

This paper investigates the security vulnerabilities of quantized edge-deployed large language models, revealing that their semantic knowledge can be substantially extracted through structured queries despite quantization noise.

Contribution

It introduces CLIQ, a novel structured query framework that enhances knowledge extraction efficiency from quantized LLMs, highlighting security risks in edge deployments.

Findings

01

Quantization does not prevent semantic knowledge extraction.

02

CLIQ outperforms original queries in multiple evaluation metrics.

03

Quantization alone is insufficient for protecting LLMs against extraction attacks.

Abstract

Large language models (LLMs) are increasingly deployed on edge devices under strict computation and quantization constraints, yet their security implications remain unclear. We study query-based knowledge extraction from quantized edge-deployed LLMs under realistic query budgets and show that, although quantization introduces noise, it does not remove the underlying semantic knowledge, allowing substantial behavioral recovery through carefully designed queries. To systematically analyze this risk, we propose \textbf{CLIQ} (\textbf{Cl}ustered \textbf{I}nstruction \textbf{Q}uerying), a structured query construction framework that improves semantic coverage while reducing redundancy. Experiments on quantized Qwen models (INT8/INT4) demonstrate that CLIQ consistently outperforms original queries across BERTScore, BLEU, and ROUGE, enabling more efficient extraction under limited budgets.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Big Data and Digital Economy · Topic Modeling