DuFFin: A Dual-Level Fingerprinting Framework for LLMs IP Protection

Yuliang Yan; Haochun Tang; Shuo Yan; Enyan Dai

arXiv:2505.16530·cs.CR·February 3, 2026

DuFFin: A Dual-Level Fingerprinting Framework for LLMs IP Protection

Yuliang Yan, Haochun Tang, Shuo Yan, Enyan Dai

PDF

Open Access 1 Repo 1 Video

TL;DR

DuFFin is a dual-level fingerprinting framework designed to protect the intellectual property of large language models by accurately verifying ownership in black-box settings through trigger patterns and knowledge-level fingerprints.

Contribution

It introduces a novel dual-level fingerprinting approach that effectively verifies LLM ownership without impacting text generation or requiring white-box access.

Findings

01

Achieves IP-ROC > 0.95 in experiments

02

Works across various model types and modifications

03

Effective in black-box ownership verification

Abstract

Large language models (LLMs) are considered valuable Intellectual Properties (IP) for legitimate owners due to the enormous computational cost of training. It is crucial to protect the IP of LLMs from malicious stealing or unauthorized deployment. Despite existing efforts in watermarking and fingerprinting LLMs, these methods either impact the text generation process or are limited in white-box access to the suspect model, making them impractical. Hence, we propose DuFFin, a novel $Du$ al-Level $Fin$ gerprinting $F$ ramework for black-box setting ownership verification. DuFFin extracts the trigger pattern and the knowledge-level fingerprints to identify the source of a suspect model. We conduct experiments on a variety of models collected from the open-source website, including four popular base models as protected LLMs and their fine-tuning, quantization, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuliangyan0807/llm-fingerprint
pytorchOfficial

Videos

DuFFin: A Dual-Level Fingerprinting Framework for LLMs IP Protection· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Physical Unclonable Functions (PUFs) and Hardware Security

MethodsBalanced Selection