WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation Watermarks

Anudeex Shetty; Qiongkai Xu; Jey Han Lau

arXiv:2409.04459·cs.CR·June 3, 2025

WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation Watermarks

Anudeex Shetty, Qiongkai Xu, Jey Han Lau

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new linear transformation watermarking method for Embeddings-as-a-Service that is robust against paraphrasing attacks, addressing vulnerabilities of existing watermark techniques.

Contribution

A novel linear transformation watermarking technique for EaaS embeddings that resists paraphrasing-based removal, supported by empirical and theoretical analysis.

Findings

01

Existing watermarks can be removed by paraphrasing attacks.

02

The proposed linear transformation watermark is robust against paraphrasing.

03

The method is supported by both empirical experiments and theoretical proofs.

Abstract

Embeddings-as-a-Service (EaaS) is a service offered by large language model (LLM) developers to supply embeddings generated by LLMs. Previous research suggests that EaaS is prone to imitation attacks -- attacks that clone the underlying EaaS model by training another model on the queried embeddings. As a result, EaaS watermarks are introduced to protect the intellectual property of EaaS providers. In this paper, we first show that existing EaaS watermarks can be removed by paraphrasing when attackers clone the model. Subsequently, we propose a novel watermarking technique that involves linearly transforming the embeddings, and show that it is empirically and theoretically robust against paraphrasing.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anudeex/wet
pytorchOfficial

Videos

WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation Watermarks· underline

Taxonomy

TopicsSecurity and Verification in Computing · Internet Traffic Analysis and Secure E-voting · Advanced Malware Detection Techniques

Methodstravel james