k-SemStamp: A Clustering-Based Semantic Watermark for Detection of   Machine-Generated Text

Abe Bohan Hou; Jingyu Zhang; Yichen Wang; Daniel Khashabi; Tianxing He

arXiv:2402.11399·cs.CL·June 11, 2024·2 cites

k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text

Abe Bohan Hou, Jingyu Zhang, Yichen Wang, Daniel Khashabi, Tianxing He

PDF

Open Access 2 Repos 1 Video

TL;DR

k-SemStamp introduces a clustering-based semantic watermarking method that enhances robustness and efficiency in detecting machine-generated text, outperforming previous LSH-based approaches.

Contribution

It replaces LSH with k-means clustering for semantic partitioning, improving robustness and sampling efficiency in watermark detection.

Findings

01

k-SemStamp outperforms LSH-based SemStamp in robustness.

02

It maintains generation quality while improving detection efficiency.

03

Experimental results confirm enhanced detection performance.

Abstract

Recent watermarked generation algorithms inject detectable signatures during language generation to facilitate post-hoc detection. While token-level watermarks are vulnerable to paraphrase attacks, SemStamp (Hou et al., 2023) applies watermark on the semantic representation of sentences and demonstrates promising robustness. SemStamp employs locality-sensitive hashing (LSH) to partition the semantic space with arbitrary hyperplanes, which results in a suboptimal tradeoff between robustness and speed. We propose k-SemStamp, a simple yet effective enhancement of SemStamp, utilizing k-means clustering as an alternative of LSH to partition the embedding space with awareness of inherent semantic structure. Experimental results indicate that k-SemStamp saliently improves its robustness and sampling efficiency while preserving the generation quality, advancing a more effective tool for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text· underline

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · User Authentication and Security Systems · Internet Traffic Analysis and Secure E-voting

Methodsk-Means Clustering