SWAP: Towards Copyright Auditing of Soft Prompts via Sequential Watermarking

Wenyuan Yang; Yichen Sun; Changzheng Chen; Zhixuan Chu; Jiaheng Zhang; Yiming Li; Dacheng Tao

arXiv:2511.04711·cs.CR·November 10, 2025

SWAP: Towards Copyright Auditing of Soft Prompts via Sequential Watermarking

Wenyuan Yang, Yichen Sun, Changzheng Chen, Zhixuan Chu, Jiaheng Zhang, Yiming Li, Dacheng Tao

PDF

Open Access

TL;DR

This paper introduces SWAP, a novel sequential watermarking method for soft prompts in vision-language models like CLIP, enabling effective copyright auditing without harming model performance.

Contribution

The paper proposes SWAP, a new watermarking technique that embeds watermarks in a complex space to improve copyright protection of soft prompts.

Findings

01

SWAP effectively detects protected soft prompts across multiple datasets.

02

SWAP is robust against adaptive attacks and does not affect model accuracy.

03

Existing auditing techniques are ineffective for soft prompts due to their unique learning characteristics.

Abstract

Large-scale vision-language models, especially CLIP, have demonstrated remarkable performance across diverse downstream tasks. Soft prompts, as carefully crafted modules that efficiently adapt vision-language models to specific tasks, necessitate effective copyright protection. In this paper, we investigate model copyright protection by auditing whether suspicious third-party models incorporate protected soft prompts. While this can be viewed as a special case of model ownership auditing, our analysis shows that existing techniques are ineffective due to prompt learning's unique characteristics. Non-intrusive auditing is inherently prone to false positives when independent models share similar data distributions with victim models. Intrusive approaches also fail: backdoor methods designed for CLIP cannot embed functional triggers, while extending traditional DNN backdoor techniques to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks