Adaptive White-Box Watermarking with Self-Mutual Check Parameters in   Deep Neural Networks

Zhenzhe Gao; Zhaoxia Yin; Hongjian Zhan; Heng Yin; Yue Lu

arXiv:2308.11235·cs.CR·September 6, 2023·1 cites

Adaptive White-Box Watermarking with Self-Mutual Check Parameters in Deep Neural Networks

Zhenzhe Gao, Zhaoxia Yin, Hongjian Zhan, Heng Yin, Yue Lu

PDF

Open Access

TL;DR

This paper introduces an adaptive white-box watermarking method for deep neural networks that detects, locates, and restores tampered parameters, improving security and accuracy retention under attack conditions.

Contribution

It presents a novel adaptive embedding technique combined with self-mutual check parameters for precise tampering detection and model restoration.

Findings

01

Achieved high recovery performance with modification rates below 20%

02

Recovered over 15% of accuracy loss in models with significant watermarking impact

03

Effective in detecting and restoring tampered parameters in neural networks

Abstract

Artificial Intelligence (AI) has found wide application, but also poses risks due to unintentional or malicious tampering during deployment. Regular checks are therefore necessary to detect and prevent such risks. Fragile watermarking is a technique used to identify tampering in AI models. However, previous methods have faced challenges including risks of omission, additional information transmission, and inability to locate tampering precisely. In this paper, we propose a method for detecting tampered parameters and bits, which can be used to detect, locate, and restore parameters that have been tampered with. We also propose an adaptive embedding method that maximizes information capacity while maintaining model accuracy. Our approach was tested on multiple neural networks subjected to attacks that modified weight parameters, and our results demonstrate that our method achieved great…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Advanced Neural Network Applications