EditMF: Drawing an Invisible Fingerprint for Your Large Language Models
Jiaxuan Wu, Yinghan Zhou, Wanli Peng, Yiming Xue, Juan Wen, Ping Zhong

TL;DR
EditMF introduces a training-free, highly imperceptible fingerprinting method for large language models that ensures ownership verification with minimal impact on model performance and high robustness against attacks.
Contribution
The paper presents EditMF, a novel training-free fingerprinting approach that embeds ownership marks into LLMs using minimal, semantically coherent modifications, improving stealth and efficiency over prior methods.
Findings
High imperceptibility of embedded fingerprints
Negligible performance loss on LLMs
Robustness surpassing LoRA-based methods
Abstract
Training large language models (LLMs) is resource-intensive and expensive, making protecting intellectual property (IP) for LLMs crucial. Recently, embedding fingerprints into LLMs has emerged as a prevalent method for establishing model ownership. However, existing back-door-based methods suffer from limited stealth and efficiency. To simultaneously address these issues, we propose EditMF, a training-free fingerprinting paradigm that achieves highly imperceptible fingerprint embedding with minimal computational overhead. Ownership bits are mapped to compact, semantically coherent triples drawn from an encrypted artificial knowledge base (e.g., virtual author-novel-protagonist facts). Causal tracing localizes the minimal set of layers influencing each triple, and a zero-space update injects the fingerprint without perturbing unrelated knowledge. Verification requires only a single…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Privacy-Preserving Technologies in Data
