Adaptive White-Box Watermarking with Self-Mutual Check Parameters in Deep Neural Networks
Zhenzhe Gao, Zhaoxia Yin, Hongjian Zhan, Heng Yin, Yue Lu

TL;DR
This paper introduces an adaptive white-box watermarking method for deep neural networks that detects, locates, and restores tampered parameters, improving security and accuracy retention under attack conditions.
Contribution
It presents a novel adaptive embedding technique combined with self-mutual check parameters for precise tampering detection and model restoration.
Findings
Achieved high recovery performance with modification rates below 20%
Recovered over 15% of accuracy loss in models with significant watermarking impact
Effective in detecting and restoring tampered parameters in neural networks
Abstract
Artificial Intelligence (AI) has found wide application, but also poses risks due to unintentional or malicious tampering during deployment. Regular checks are therefore necessary to detect and prevent such risks. Fragile watermarking is a technique used to identify tampering in AI models. However, previous methods have faced challenges including risks of omission, additional information transmission, and inability to locate tampering precisely. In this paper, we propose a method for detecting tampered parameters and bits, which can be used to detect, locate, and restore parameters that have been tampered with. We also propose an adaptive embedding method that maximizes information capacity while maintaining model accuracy. Our approach was tested on multiple neural networks subjected to attacks that modified weight parameters, and our results demonstrate that our method achieved great…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Advanced Neural Network Applications
