SEAL: Subspace-Anchored Watermarks for LLM Ownership
Yanbo Dai, Zongjie Li, Zhenlan Ji, Shuai Wang

TL;DR
SEAL introduces a novel subspace-anchored watermarking method for LLMs that embeds multi-bit signatures into model representations, ensuring robust, stealthy, and verifiable ownership protection without impairing model performance.
Contribution
This work presents the first subspace-anchored watermarking framework for LLMs, enabling effective ownership verification in both white-box and black-box settings while resisting knowledgeable attacks.
Findings
SEAL outperforms 11 existing methods in effectiveness, fidelity, and robustness.
The watermark remains intact even under model fine-tuning and knowledge distillation.
SEAL maintains high verification accuracy against adversaries with full knowledge of the watermarking process.
Abstract
Large language models (LLMs) have achieved remarkable success across a wide range of natural language processing tasks, demonstrating human-level performance in text generation, reasoning, and question answering. However, training such models requires substantial computational resources, large curated datasets, and sophisticated alignment procedures. As a result, they constitute highly valuable intellectual property (IP) assets that warrant robust protection mechanisms. Existing IP protection approaches suffer from critical limitations. Model fingerprinting techniques can identify model architectures but fail to establish ownership of specific model instances. In contrast, traditional backdoor-based watermarking methods embed behavioral anomalies that can be easily removed through common post-processing operations such as fine-tuning or knowledge distillation. We propose SEAL, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Graph Neural Networks · Advanced Malware Detection Techniques
