Lost in Overlap: Exploring Logit-based Watermark Collision in LLMs

Yiyang Luo; Ke Lin; Chao Gu; Jiahui Hou; Lijie Wen; Ping Luo

arXiv:2403.10020·cs.CL·February 6, 2025·1 cites

Lost in Overlap: Exploring Logit-based Watermark Collision in LLMs

Yiyang Luo, Ke Lin, Chao Gu, Jiahui Hou, Lijie Wen, Ping Luo

PDF

Open Access

TL;DR

This paper investigates the issue of watermark collision in logit-based watermarking methods for LLMs, revealing its threat to copyright protection and proposing it as a new attack strategy.

Contribution

It introduces watermark collision as a novel attack concept that undermines logit-based watermarking across various LLM tasks and applications.

Findings

01

Watermark collision significantly reduces watermark detection accuracy.

02

It poses a universal threat to all logit-based watermark algorithms.

03

Watermark collision impacts downstream applications and content integrity.

Abstract

The proliferation of large language models (LLMs) in generating content raises concerns about text copyright. Watermarking methods, particularly logit-based approaches, embed imperceptible identifiers into text to address these challenges. However, the widespread usage of watermarking across diverse LLMs has led to an inevitable issue known as watermark collision during common tasks, such as paraphrasing or translation. In this paper, we introduce watermark collision as a novel and general philosophy for watermark attacks, aimed at enhancing attack performance on top of any other attacking methods. We also provide a comprehensive demonstration that watermark collision poses a threat to all logit-based watermark algorithms, impacting not only specific attack scenarios but also downstream applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Rights Management and Security · Auction Theory and Applications · Blockchain Technology Applications and Security