Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking
Yuan Yao, Jin Song, Jian Jin

TL;DR
NeuralMark introduces a hashed watermark filter for neural network watermarking that enhances robustness against forging, overwriting, fine-tuning, and pruning attacks across diverse architectures and tasks.
Contribution
It presents a novel hashed watermark filter method that significantly improves the security and robustness of weight-based neural network watermarking.
Findings
Effective against forging and overwriting attacks
Resistant to fine-tuning and pruning
Validated across 13 architectures and multiple tasks
Abstract
As valuable digital assets, deep neural networks necessitate robust ownership protection, positioning neural network watermarking (NNW) as a promising solution. Among various NNW approaches, weight-based methods are favored for their simplicity and practicality; however, they remain vulnerable to forging and overwriting attacks. To address those challenges, we propose NeuralMark, a robust method built around a hashed watermark filter. Specifically, we utilize a hash function to generate an irreversible binary watermark from a secret key, which is then used as a filter to select the model parameters for embedding. This design cleverly intertwines the embedding parameters with the hashed watermark, providing a robust defense against both forging and overwriting attacks. An average pooling is also incorporated to resist fine-tuning and pruning attacks. Furthermore, it can be seamlessly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
MethodsDropout · Label Smoothing · Byte Pair Encoding · Pruning · Absolute Position Encodings · Layer Normalization · Dense Connections · Softmax · Transformer
