Reversible Watermarking in Deep Convolutional Neural Networks for Integrity Authentication
Xiquan Guan, Huamin Feng, Weiming Zhang, Hang Zhou, Jie Zhang, and, Nenghai Yu

TL;DR
This paper introduces a reversible watermarking method for deep convolutional neural networks that enables integrity authentication without permanently altering the model, allowing full recovery and detection of unauthorized modifications.
Contribution
It proposes a novel reversible watermarking algorithm based on model pruning and histogram shifting, suitable for verifying model integrity while preserving original parameters.
Findings
Embedding has less than 0.5% impact on classification accuracy
Model parameters can be fully recovered after watermark extraction
Model integrity can be effectively verified through watermark comparison
Abstract
Deep convolutional neural networks have made outstanding contributions in many fields such as computer vision in the past few years and many researchers published well-trained network for downloading. But recent studies have shown serious concerns about integrity due to model-reuse attacks and backdoor attacks. In order to protect these open-source networks, many algorithms have been proposed such as watermarking. However, these existing algorithms modify the contents of the network permanently and are not suitable for integrity authentication. In this paper, we propose a reversible watermarking algorithm for integrity authentication. Specifically, we present the reversible watermarking problem of deep convolutional neural networks and utilize the pruning theory of model compression technology to construct a host sequence used for embedding watermarking information by histogram shift.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
