CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment

Qinfeng Li; Tianyue Luo; Xuhong Zhang; Yangfan Xie; Zhiqiang Shen; Lijun Zhang; Yier Jin; Hao Peng; Xinkui Zhao; Xianwei Zhu; and Jianwei Yin

arXiv:2410.13903·cs.CR·April 28, 2026

CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment

Qinfeng Li, Tianyue Luo, Xuhong Zhang, Yangfan Xie, Zhiqiang Shen, Lijun Zhang, Yier Jin, Hao Peng, Xinkui Zhao, Xianwei Zhu, and Jianwei Yin

PDF

1 Video

TL;DR

CoreGuard is a novel, efficient protection protocol designed to safeguard proprietary large language models deployed on edge devices from model stealing and fine-tuning attacks, with minimal overhead.

Contribution

It introduces CoreGuard, a computation- and communication-efficient method that provides strong security for edge-deployed LLMs against model extraction and misuse.

Findings

01

CoreGuard achieves near-maximum security protection.

02

It incurs negligible computational and communication overhead.

03

Extensive experiments validate its effectiveness and efficiency.

Abstract

Proprietary large language models (LLMs) exhibit strong generalization capabilities across diverse tasks and are increasingly deployed on edge devices for efficiency and privacy reasons. However, deploying proprietary LLMs at the edge without adequate protection introduces critical security threats. Attackers can extract model weights and architectures, enabling unauthorized copying and misuse. Even when protective measures prevent full extraction of model weights, attackers may still perform advanced attacks, such as fine-tuning, to further exploit the model. Existing defenses against these threats typically incur significant computational and communication overhead, making them impractical for edge deployment. To safeguard the edge-deployed LLMs, we introduce CoreGuard, a computation- and communication-efficient protection method. CoreGuard employs an efficient protection protocol to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment· slideslive