Adversarial Frontier Stitching for Remote Neural Network Watermarking
Erwan Le Merrer, Patrick Perez, Gilles Tr\'edan

TL;DR
This paper introduces a novel method for watermarking remote neural network models by subtly altering their decision boundaries, enabling watermark extraction through minimal queries without affecting model performance.
Contribution
It proposes a zero-bit watermarking algorithm using adversarial examples that allows remote model ownership verification via API queries, a novel approach in model protection.
Findings
Effective watermark embedding with minimal performance loss
Watermark can be extracted with few API queries
Applicable to image classification neural networks
Abstract
The state of the art performance of deep learning models comes at a high cost for companies and institutions, due to the tedious data collection and the heavy processing requirements. Recently, [35, 22] proposed to watermark convolutional neural networks for image classification, by embedding information into their weights. While this is a clear progress towards model protection, this technique solely allows for extracting the watermark from a network that one accesses locally and entirely. Instead, we aim at allowing the extraction of the watermark from a neural network (or any other machine learning model) that is operated remotely, and available through a service API. To this end, we propose to mark the model's action itself, tweaking slightly its decision frontiers so that a set of specific queries convey the desired information. In the present paper, we formally introduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
