SWAP: Towards Copyright Auditing of Soft Prompts via Sequential Watermarking
Wenyuan Yang, Yichen Sun, Changzheng Chen, Zhixuan Chu, Jiaheng Zhang, Yiming Li, Dacheng Tao

TL;DR
This paper introduces SWAP, a novel sequential watermarking method for soft prompts in vision-language models like CLIP, enabling effective copyright auditing without harming model performance.
Contribution
The paper proposes SWAP, a new watermarking technique that embeds watermarks in a complex space to improve copyright protection of soft prompts.
Findings
SWAP effectively detects protected soft prompts across multiple datasets.
SWAP is robust against adaptive attacks and does not affect model accuracy.
Existing auditing techniques are ineffective for soft prompts due to their unique learning characteristics.
Abstract
Large-scale vision-language models, especially CLIP, have demonstrated remarkable performance across diverse downstream tasks. Soft prompts, as carefully crafted modules that efficiently adapt vision-language models to specific tasks, necessitate effective copyright protection. In this paper, we investigate model copyright protection by auditing whether suspicious third-party models incorporate protected soft prompts. While this can be viewed as a special case of model ownership auditing, our analysis shows that existing techniques are ineffective due to prompt learning's unique characteristics. Non-intrusive auditing is inherently prone to false positives when independent models share similar data distributions with victim models. Intrusive approaches also fail: backdoor methods designed for CLIP cannot embed functional triggers, while extending traditional DNN backdoor techniques to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks
