Don't Forget to Sign the Gradients!
Omid Aramoon, Pin-Yu Chen, Gang Qu

TL;DR
GradSigns is a novel watermarking framework for deep neural networks that embeds signatures into input gradients, offering robustness against attacks and minimal performance impact, thus protecting model intellectual property.
Contribution
This paper introduces GradSigns, a new method for watermarking DNNs by embedding signatures into input gradients, enhancing robustness and capacity over existing techniques.
Findings
GradSigns is robust against all known watermark removal attacks.
It can embed a large amount of information into DNNs.
The approach has negligible impact on model performance.
Abstract
Engineering a top-notch deep learning model is an expensive procedure that involves collecting data, hiring human resources with expertise in machine learning, and providing high computational resources. For that reason, deep learning models are considered as valuable Intellectual Properties (IPs) of the model vendors. To ensure reliable commercialization of deep learning models, it is crucial to develop techniques to protect model vendors against IP infringements. One of such techniques that recently has shown great promise is digital watermarking. However, current watermarking approaches can embed very limited amount of information and are vulnerable against watermark removal attacks. In this paper, we present GradSigns, a novel watermarking framework for deep neural networks (DNNs). GradSigns embeds the owner's signature into the gradient of the cross-entropy cost function with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Internet Traffic Analysis and Secure E-voting
