EditMF: Drawing an Invisible Fingerprint for Your Large Language Models

Jiaxuan Wu; Yinghan Zhou; Wanli Peng; Yiming Xue; Juan Wen; Ping Zhong

arXiv:2508.08836·cs.CR·August 13, 2025

EditMF: Drawing an Invisible Fingerprint for Your Large Language Models

Jiaxuan Wu, Yinghan Zhou, Wanli Peng, Yiming Xue, Juan Wen, Ping Zhong

PDF

Open Access

TL;DR

EditMF introduces a training-free, highly imperceptible fingerprinting method for large language models that ensures ownership verification with minimal impact on model performance and high robustness against attacks.

Contribution

The paper presents EditMF, a novel training-free fingerprinting approach that embeds ownership marks into LLMs using minimal, semantically coherent modifications, improving stealth and efficiency over prior methods.

Findings

01

High imperceptibility of embedded fingerprints

02

Negligible performance loss on LLMs

03

Robustness surpassing LoRA-based methods

Abstract

Training large language models (LLMs) is resource-intensive and expensive, making protecting intellectual property (IP) for LLMs crucial. Recently, embedding fingerprints into LLMs has emerged as a prevalent method for establishing model ownership. However, existing back-door-based methods suffer from limited stealth and efficiency. To simultaneously address these issues, we propose EditMF, a training-free fingerprinting paradigm that achieves highly imperceptible fingerprint embedding with minimal computational overhead. Ownership bits are mapped to compact, semantically coherent triples drawn from an encrypted artificial knowledge base (e.g., virtual author-novel-protagonist facts). Causal tracing localizes the minimal set of layers influencing each triple, and a zero-space update injects the fingerprint without perturbing unrelated knowledge. Verification requires only a single…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Privacy-Preserving Technologies in Data