PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification
Hongwei Yao, Jian Lou, Kui Ren, Zhan Qin

TL;DR
PromptCARE introduces a novel watermarking framework specifically designed for protecting the copyright of prompts used in large language models, ensuring secure verification without compromising performance.
Contribution
This work is the first to develop prompt-specific watermark injection and verification schemes tailored for NLP and LLM prompts, addressing unique challenges in prompt copyright protection.
Findings
Effective watermarking on six benchmark datasets
Robust against various attack scenarios
Harmless and stealthy watermark embedding
Abstract
Large language models (LLMs) have witnessed a meteoric rise in popularity among the general public users over the past few months, facilitating diverse downstream tasks with human-level accuracy and proficiency. Prompts play an essential role in this success, which efficiently adapt pre-trained LLMs to task-specific applications by simply prepending a sequence of tokens to the query texts. However, designing and selecting an optimal prompt can be both expensive and demanding, leading to the emergence of Prompt-as-a-Service providers who profit by providing well-designed prompts for authorized use. With the growing popularity of prompts and their indispensable role in LLM-based services, there is an urgent need to protect the copyright of prompts against unauthorized use. In this paper, we propose PromptCARE, the first framework for prompt copyright protection through watermark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data
