ProFLingo: A Fingerprinting-based Intellectual Property Protection Scheme for Large Language Models
Heng Jin, Chaoyu Zhang, Shanghao Shi, Wenjing Lou, Y., Thomas Hou

TL;DR
ProFLingo introduces a novel black-box fingerprinting method for protecting the intellectual property of large language models, enabling owners to verify model derivation without modifying or accessing the suspect model.
Contribution
This paper presents the first black-box fingerprinting scheme for LLMs that does not require model modifications or prior knowledge of the suspect model.
Findings
Effective in identifying derived models
Non-invasive and model-agnostic approach
Open-source implementation available
Abstract
Large language models (LLMs) have attracted significant attention in recent years. Due to their "Large" nature, training LLMs from scratch consumes immense computational resources. Since several major players in the artificial intelligence (AI) field have open-sourced their original LLMs, an increasing number of individuals and smaller companies are able to build derivative LLMs based on these open-sourced models at much lower costs. However, this practice opens up possibilities for unauthorized use or reproduction that may not comply with licensing agreements, and fine-tuning can change the model's behavior, thus complicating the determination of model ownership. Current intellectual property (IP) protection schemes for LLMs are either designed for white-box settings or require additional modifications to the original model, which restricts their use in real-world settings. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Digital Rights Management and Security · Hate Speech and Cyberbullying Detection
MethodsBalanced Selection
