DeepSigns: A Generic Watermarking Framework for IP Protection of Deep Learning Models
Bita Darvish Rouhani, Huili Chen, Farinaz Koushanfar

TL;DR
DeepSigns introduces a universal watermarking framework for protecting the intellectual property of deep learning models by embedding signatures into the models' internal data distributions, resilient against various attacks.
Contribution
It presents the first generic watermarking methodology applicable in both white-box and black-box scenarios for deep learning models.
Findings
Effective watermark embedding in multiple model architectures.
Resilience against model compression, fine-tuning, and overwriting.
Validated on MNIST and CIFAR10 datasets.
Abstract
Deep Learning (DL) models have caused a paradigm shift in our ability to comprehend raw data in various important fields, ranging from intelligence warfare and healthcare to autonomous transportation and automated manufacturing. A practical concern, in the rush to adopt DL models as a service, is protecting the models against Intellectual Property (IP) infringement. The DL models are commonly built by allocating significant computational resources that process vast amounts of proprietary training data. The resulting models are therefore considered to be the IP of the model builder and need to be protected to preserve the owner's competitive advantage. This paper proposes DeepSigns, a novel end-to-end IP protection framework that enables insertion of coherent digital watermarks in contemporary DL models. DeepSigns, for the first time, introduces a generic watermarking methodology that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Steganography and Watermarking Techniques · Digital Media Forensic Detection
