Protecting Intellectual Property of Language Generation APIs with Lexical Watermark
Xuanli He, Qiongkai Xu, Lingjuan Lyu, Fangzhao Wu, Chenguang Wang

TL;DR
This paper introduces a lexical watermarking technique to protect the intellectual property of natural language generation APIs by embedding identifiable watermarks into generated text, effectively deterring model extraction attacks.
Contribution
The work presents a novel lexical modification-based watermarking method for NLG APIs that outperforms existing techniques in detectability, semantic preservation, and human interpretability.
Findings
Watermarks are more detectable with lower semantic loss.
The approach is effective across different domains.
It remains robust even when attackers train on mixed data with minimal watermarked samples.
Abstract
Nowadays, due to the breakthrough in natural language generation (NLG), including machine translation, document summarization, image captioning, etc NLG models have been encapsulated in cloud APIs to serve over half a billion people worldwide and process over one hundred billion word generations per day. Thus, NLG APIs have already become essential profitable services in many commercial companies. Due to the substantial financial and intellectual investments, service providers adopt a pay-as-you-use policy to promote sustainable market growth. However, recent works have shown that cloud platforms suffer from financial losses imposed by model extraction attacks, which aim to imitate the functionality and utility of the victim services, thus violating the intellectual property (IP) of cloud APIs. This work targets at protecting IP of NLG APIs by identifying the attackers who have utilized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Security and Verification in Computing
Methodstravel james
