SEAL: Subspace-Anchored Watermarks for LLM Ownership

Yanbo Dai; Zongjie Li; Zhenlan Ji; Shuai Wang

arXiv:2511.11356·cs.CR·November 17, 2025

SEAL: Subspace-Anchored Watermarks for LLM Ownership

Yanbo Dai, Zongjie Li, Zhenlan Ji, Shuai Wang

PDF

Open Access

TL;DR

SEAL introduces a novel subspace-anchored watermarking method for LLMs that embeds multi-bit signatures into model representations, ensuring robust, stealthy, and verifiable ownership protection without impairing model performance.

Contribution

This work presents the first subspace-anchored watermarking framework for LLMs, enabling effective ownership verification in both white-box and black-box settings while resisting knowledgeable attacks.

Findings

01

SEAL outperforms 11 existing methods in effectiveness, fidelity, and robustness.

02

The watermark remains intact even under model fine-tuning and knowledge distillation.

03

SEAL maintains high verification accuracy against adversaries with full knowledge of the watermarking process.

Abstract

Large language models (LLMs) have achieved remarkable success across a wide range of natural language processing tasks, demonstrating human-level performance in text generation, reasoning, and question answering. However, training such models requires substantial computational resources, large curated datasets, and sophisticated alignment procedures. As a result, they constitute highly valuable intellectual property (IP) assets that warrant robust protection mechanisms. Existing IP protection approaches suffer from critical limitations. Model fingerprinting techniques can identify model architectures but fail to establish ownership of specific model instances. In contrast, traditional backdoor-based watermarking methods embed behavioral anomalies that can be easily removed through common post-processing operations such as fine-tuning or knowledge distillation. We propose SEAL, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Advanced Malware Detection Techniques