PromptCARE: Prompt Copyright Protection by Watermark Injection and   Verification

Hongwei Yao; Jian Lou; Kui Ren; Zhan Qin

arXiv:2308.02816·cs.MM·November 29, 2023·1 cites

PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification

Hongwei Yao, Jian Lou, Kui Ren, Zhan Qin

PDF

Open Access 1 Repo

TL;DR

PromptCARE introduces a novel watermarking framework specifically designed for protecting the copyright of prompts used in large language models, ensuring secure verification without compromising performance.

Contribution

This work is the first to develop prompt-specific watermark injection and verification schemes tailored for NLP and LLM prompts, addressing unique challenges in prompt copyright protection.

Findings

01

Effective watermarking on six benchmark datasets

02

Robust against various attack scenarios

03

Harmless and stealthy watermark embedding

Abstract

Large language models (LLMs) have witnessed a meteoric rise in popularity among the general public users over the past few months, facilitating diverse downstream tasks with human-level accuracy and proficiency. Prompts play an essential role in this success, which efficiently adapt pre-trained LLMs to task-specific applications by simply prepending a sequence of tokens to the query texts. However, designing and selecting an optimal prompt can be both expensive and demanding, leading to the emergence of Prompt-as-a-Service providers who profit by providing well-designed prompts for authorized use. With the growing popularity of prompts and their indispensable role in LLM-based services, there is an urgent need to protect the copyright of prompts against unauthorized use. In this paper, we propose PromptCARE, the first framework for prompt copyright protection through watermark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

grasses/promptcare
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data