KinGuard: Hierarchical Kinship-Aware Fingerprinting to Defend Against Large Language Model Stealing

Zhenhua Xu; Xiaoning Tian; Wenjun Zeng; Wenpeng Xing; Tianliang Lu; Gaolei Li; Chaochao Chen; and Meng Han

arXiv:2601.12986·cs.CR·January 22, 2026

KinGuard: Hierarchical Kinship-Aware Fingerprinting to Defend Against Large Language Model Stealing

Zhenhua Xu, Xiaoning Tian, Wenjun Zeng, Wenpeng Xing, Tianliang Lu, Gaolei Li, Chaochao Chen, and Meng Han

PDF

Open Access

TL;DR

KinGuard introduces a novel kinship-aware fingerprinting framework that embeds structured knowledge into language models, enabling robust, stealthy, and resilient ownership verification against various attacks.

Contribution

The paper proposes KinGuard, a knowledge-based embedding approach that overcomes the limitations of traditional backdoor fingerprinting for LLM ownership verification.

Findings

01

KinGuard outperforms existing methods in effectiveness and stealth.

02

It remains resilient against fine-tuning, input perturbation, and model merging attacks.

03

Knowledge embedding provides a secure paradigm for model fingerprinting.

Abstract

Protecting the intellectual property of large language models requires robust ownership verification. Conventional backdoor fingerprinting, however, is flawed by a stealth-robustness paradox: to be robust, these methods force models to memorize fixed responses to high-perplexity triggers, but this targeted overfitting creates detectable statistical artifacts. We resolve this paradox with KinGuard, a framework that embeds a private knowledge corpus built on structured kinship narratives. Instead of memorizing superficial triggers, the model internalizes this knowledge via incremental pre-training, and ownership is verified by probing its conceptual understanding. Extensive experiments demonstrate KinGuard's superior effectiveness, stealth, and resilience against a battery of attacks including fine-tuning, input perturbation, and model merging. Our work establishes knowledge-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Topic Modeling