Digital Watermarking for Deep Neural Networks
Yuki Nagai, Yusuke Uchida, Shigeyuki Sakazawa, Shin'ichi Satoh

TL;DR
This paper introduces a novel digital watermarking method for deep neural networks, enabling ownership verification without impairing performance, even after fine-tuning or pruning, thus protecting intellectual property in AI models.
Contribution
It proposes a general framework for embedding watermarks into neural network parameters during training, fine-tuning, or distillation, ensuring robustness and non-intrusiveness.
Findings
Watermarks remain intact after fine-tuning and pruning.
The method does not impair network performance.
Watermarks survive up to 65% parameter pruning.
Abstract
Although deep neural networks have made tremendous progress in the area of multimedia representation, training neural models requires a large amount of data and time. It is well-known that utilizing trained models as initial weights often achieves lower training error than neural networks that are not pre-trained. A fine-tuning step helps to reduce both the computational cost and improve performance. Therefore, sharing trained models has been very important for the rapid progress of research and development. In addition, trained models could be important assets for the owner(s) who trained them, hence we regard trained models as intellectual property. In this paper, we propose a digital watermarking technology for ownership authorization of deep neural networks. First, we formulate a new problem: embedding watermarks into deep neural networks. We also define requirements, embedding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
