GaussMark: A Practical Approach for Structural Watermarking of Language Models
Adam Block, Ayush Sekhari, and Alexander Rakhlin

TL;DR
GaussMark introduces a practical, statistically guaranteed structural watermarking method for large language models by embedding a secret signal into model weights through Gaussian noise, ensuring detectability without impacting performance.
Contribution
This work presents GaussMark, a novel weight-based watermarking scheme for LLMs that is simple, efficient, and offers formal statistical guarantees, addressing limitations of previous token-level methods.
Findings
Watermark detection is statistically reliable with formal bounds.
Watermarking does not degrade model quality.
Method is robust to common text corruptions.
Abstract
Recent advances in Large Language Models (LLMs) have led to significant improvements in natural language processing tasks, but their ability to generate human-quality text raises significant ethical and operational concerns in settings where it is important to recognize whether or not a given text was generated by a human. Thus, recent work has focused on developing techniques for watermarking LLM-generated text, i.e., introducing an almost imperceptible signal that allows a provider equipped with a secret key to determine if given text was generated by their model. Current watermarking techniques are often not practical due to concerns with generation latency, detection time, degradation in text quality, or robustness. Many of these drawbacks come from the focus on token-level watermarking, which ignores the inherent structure of text. In this work, we introduce a new scheme,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Handwritten Text Recognition Techniques · Human Motion and Animation
MethodsFocus
