Beyond A Fixed Seal: Adaptive Stealing Watermark in Large Language Models

Shuhao Zhang; Yuli Chen; Jiale Han; Bo Cheng; Jiabao Ma

arXiv:2604.10893·cs.CR·April 14, 2026

Beyond A Fixed Seal: Adaptive Stealing Watermark in Large Language Models

Shuhao Zhang, Yuli Chen, Jiale Han, Bo Cheng, Jiabao Ma

PDF

1 Repo

TL;DR

This paper introduces Adaptive Stealing, a flexible watermark attack method for large language models that dynamically selects attack strategies to improve efficiency against watermarked texts.

Contribution

The paper proposes a novel adaptive stealing algorithm with position-based and dynamic selection modules, enhancing attack effectiveness over fixed strategies.

Findings

01

AS significantly improves steal efficiency against watermarks.

02

Dynamic perspective selection outperforms fixed strategies.

03

Code is publicly available for future research.

Abstract

Watermarking provides a critical safeguard for large language model (LLM) services by facilitating the detection of LLM-generated text. Correspondingly, stealing watermark algorithms (SWAs) derive watermark information from watermarked texts generated by victim LLMs to craft highly targeted adversarial attacks, which compromise the reliability of watermarks. Existing SWAs rely on fixed strategies, overlooking the non-uniform distribution of stolen watermark information and the dynamic nature of real-world LLM generation processes. To address these limitations, we propose Adaptive Stealing (AS), a novel SWA featuring enhanced design flexibility through Position-Based Seal Construction and Adaptive Selection modules. AS operates by defining multiple attack perspectives derived from distinct activation states of contextually ordered tokens. During attack execution, AS dynamically selects…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DrankXs/AdaptiveStealingWatermark
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.