Protecting Intellectual Property of Language Generation APIs with   Lexical Watermark

Xuanli He; Qiongkai Xu; Lingjuan Lyu; Fangzhao Wu; Chenguang Wang

arXiv:2112.02701·cs.CR·December 7, 2021

Protecting Intellectual Property of Language Generation APIs with Lexical Watermark

Xuanli He, Qiongkai Xu, Lingjuan Lyu, Fangzhao Wu, Chenguang Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a lexical watermarking technique to protect the intellectual property of natural language generation APIs by embedding identifiable watermarks into generated text, effectively deterring model extraction attacks.

Contribution

The work presents a novel lexical modification-based watermarking method for NLG APIs that outperforms existing techniques in detectability, semantic preservation, and human interpretability.

Findings

01

Watermarks are more detectable with lower semantic loss.

02

The approach is effective across different domains.

03

It remains robust even when attackers train on mixed data with minimal watermarked samples.

Abstract

Nowadays, due to the breakthrough in natural language generation (NLG), including machine translation, document summarization, image captioning, etc NLG models have been encapsulated in cloud APIs to serve over half a billion people worldwide and process over one hundred billion word generations per day. Thus, NLG APIs have already become essential profitable services in many commercial companies. Due to the substantial financial and intellectual investments, service providers adopt a pay-as-you-use policy to promote sustainable market growth. However, recent works have shown that cloud platforms suffer from financial losses imposed by model extraction attacks, which aim to imitate the functionality and utility of the victim services, thus violating the intellectual property (IP) of cloud APIs. This work targets at protecting IP of NLG APIs by identifying the attackers who have utilized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xlhex/nlg_api_watermark
pytorchOfficial

Videos

Protecting Intellectual Property of Language Generation APIs with Lexical Watermark· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Security and Verification in Computing

Methodstravel james