Robust Multi-bit Natural Language Watermarking through Invariant   Features

KiYoon Yoo; Wonhyuk Ahn; Jiho Jang; Nojun Kwak

arXiv:2305.01904·cs.CL·June 12, 2023·5 cites

Robust Multi-bit Natural Language Watermarking through Invariant Features

KiYoon Yoo, Wonhyuk Ahn, Jiho Jang, Nojun Kwak

PDF

Open Access 1 Repo

TL;DR

This paper proposes a robust multi-bit natural language watermarking method that leverages invariant features and a corruption-resistant infill model to enhance robustness against minor corruptions, aiding copyright protection.

Contribution

It introduces a novel watermarking framework that improves robustness by 16.8% using invariant feature analysis and a new infill model for natural language content.

Findings

01

Achieved +16.8% robustness improvement over previous methods

02

Effective watermark extraction despite minor corruptions

03

Applicable across multiple datasets and corruption types

Abstract

Recent years have witnessed a proliferation of valuable original natural language contents found in subscription-based media outlets, web novel platforms, and outputs of large language models. However, these contents are susceptible to illegal piracy and potential misuse without proper security measures. This calls for a secure watermarking system to guarantee copyright protection through leakage tracing or ownership identification. To effectively combat piracy and protect copyrights, a multi-bit watermarking framework should be able to embed adequate bits of information and extract the watermarks in a robust manner despite possible corruption. In this work, we explore ways to advance both payload and robustness by following a well-known proposition from image watermarking and identify features in natural language that are invariant to minor corruption. Through a systematic analysis of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bangawayoo/nlp-watermarking
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting · Spam and Phishing Detection